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(57) Abstract 

The present invention relates to the three- 
dimensional structures of a protein tyrosine kinase 
optionally complexcd with one or more compounds. 
The atomic coordinates that define the structures of 
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the three-dimensional structures of protein tyrosine 
kinases with unknown structure and to methods 
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CRYSTAL STRUCTURES OF A PROTEIN TYROSINE KINASE 

EEL ^ EEB _^PPTiTr , ATIONS 
This application is related to U.S. Application 
Serial No. 08/701,191, by Mohammadi, et al . . entxtle 
..Crystals of the Tyrosine Kinase Domain of Non-Insulin 
Heceptor Tyrosine Kinases," filed August 21, 1996 «*« 
. Jon Docket No. 227/033) and U.S. Ration Ser.al 
No . 60/034,168, by McMahon. et al . , entitled "Crysta 
StrU ctures of a Protein Tyrosine Kinase Complexed wxth 
Compounds of the Oxindolinone/ Thiol indolinone Fam.ly, 
filed December 19, 1996 (Lyon * Lyon Docket No 
221/232) , which are hereby incorporated herein by 

•« .heir entirety including any drawings, 
reference m their enuixcuy 

tables, and figures. 

TNTP n P TTrTIOM 
The present invention relates to the three 
dimensional structures of protein kinases. 

The following description of the background of the 
invention is provided simply as an aid in understanding 
the invention and is not admitted to describe or 
constitute prior art to the invention. 

Protein tyrosine kinases (PTKs) comprise a large 
and diverse class of enzymes (for a review, see 

* Ullrich 1992, Neuron 9: 383-391). The 
Schlessinger and Ullricn, ^ 

PTK family contains multiple subfamilies, one of whtoh 
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is the fibroblast growth fan-o 

factor receptor (FGF-R) 

subfamily (f or , ^ . r w R ; 

» P - P :::e7i:r ticaiiy transfer a h - h — 

residue in a L triPh ° SPhate E ° * 

« ; l lu p r:: These — — 

-*-yuc*.i is then conver^prl -i^*- 
response. p TKs a.. 9 cellu ^r 

PTKs are components in many si 

transduction processes. A PTK re , 9 

—i in . particuiar st Ap 2 th the flow of a 

nhn , Step ln the Process by 

Phosphorylating a downstream molecule The hh ■ • 

a Phosphate can e^H addition of 

can either modulate the activity of th. 
downstream molecule bv t u ™ i 

Cy tu ming it »on" or "off- Th 
aberrations in a Da ^^ . ' hus ' 

" a particular PTK's activity can eirh. 
cause overflow or underflow of the ■ / 
a signal can lead to such b ° f 
cell Proli feratio ;° wh S I - uncontrolled 

borders as cance and a ^ ««* 

cer and angiogenesis 

- m m . in 77™ — - 

patients. ^ admini ««ed oraily to 

Some small molecule PTK 

^e PTK inhibitors have already 
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been discovered. For example, bis (monocyclic) , bicyclic 
or heterocyclic aryl compounds (PCT WO 92/20642) , 
vinylene-azaindole derivatives (PCT WO 94/14808) # 1- 
cyclopropyl-4-pyridyl-quinolones (U.S. Patent No. 
5,330,992), styryl compounds (U.S. Patent No. 
5,217,999), styryl-substituted pyridyl compounds (U.S. 
Patent No. 5,302,606), certain quinazoline derivatives 
(EP Application No. 0 566 266 Al) , seleoindoles and 
selenides (PCT WO 94/03427) , tricyclic polyhydroxylic 
compounds (PCT WO 92/21660) , and benzylphosphonic acid 
compounds (PCT WO 91/15495) are described as PTK 
inhibitors . 

Although many PTK inhibitors are known, many of 
these are not specific for PTK subfamilies and will 
therefore cause multiple side-effects as therapeutics. 
Compounds of the indolinone family, however, are 
specific for the FGFR subfamily and are non- 
hydrolyzable . WO 96/40116, "Indolinone Compounds for 
the Treatment of Disease," published December 19, 1996, 
inventors Tang et al . Although the use of X-ray 
crystallography has provided three dimensional 
structures of other PTKs, they are not complexed with 
PTK subfamily specific, hydrolysis resistant, small 
molecules. 

Despite recent advances, the need remains in the 
art for crystallographic analysis of protein kinases, so 
that improved therapeutic molecules can be designed and 
synthesized. 



SUMMARY OF THE INVENTION 
The present invention relates to the three 
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dimensional structures of protein tyrosine kinases 
use of X-ray crystallography can define the three 
dimensional structure of protein tyrosine kinase at 
atomic resolution. 

The three dimensional structures described herein 
elucidate specific interactions between protein tyrosine 
kinases and compounds bound to them. The coordinates 
that define the three dimensional structures of protein 
tyrosine kinases are useful for determining three 
dimensional structures of PTKs with unknown structure. 
In addition, the coordinates are also useful for 
designing and identifying modulators of protein tyrosine 
kinase function. These modulators are potentially 
useful as therapeutics for diseases, including (but 
limited to) cell proliferative diseases, such as cancer, 
angiogenesis, atherosclerosis, and arthritis. 

Thus in a first aspect, the invention features a 
crystalline form of a polypeptide corresponding to the 
catalytic domain of a protein tyrosine kinase. 

The term "crystalline form, - in the context of the 
invention, is a crystal formed from an aqueous solution 
comprising a purified polypeptide corresponding to the 
catalytic domain of a PTK. A crystalline form of a 
protein tyrosine kinase is characterized as being 
capable of diffracting x-rays in a pattern defined by 
one of the crystal forms depicted in Blundel et al . , 
1976 ' Protein Crystall o graphy , Academic Press. A 
crystalline form of a protein kinase is not 
characterized as being capable of diffracting x-rays in 
a pattern analogous to a crystalline form consisting of 
primarily salt or primarily a compound, for example. 
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» or PTK, refers 

«r.vot-ein tyrosine kinase, 

The «™ v energy phosphace o£ 

to an enryme that trans on 
adenosine triphosphate to a tyr 

a protein target . „ r ,ivtic domain o£ the 

^ * p - tein t 9 ::i rrieii ^ 

— -iast growth £ aotor lW » . — 

kl nases that hi ^ ^ herein , and 

protein tyrosine Kinases ^ ^ 

can relate to one member of the FGF 

10 FGFR1 ' -catalytic domain" refers to the region of 

The term catalyt ^ 

. protein that can exrst as P 

The catalytic domain of a protein y 
P characterized as having considerable ammo 

kinase is characters other protein 

^ „,*. a ivtic domain of ocner v 

15 ^ d "^1: — Xe amino acid .entity 
tyr osme kinases. ^ ^ 

prefera :; y iriiLrs - - — y at 

pr eferably at least ^ ^ ^ 

— 4o% r/r, J -tity ™ n — - 

20 identity refer to t ^ fQr 

tyr osine kinase families ^ ^ ^ ^ 

members of a given protein y be funct ional 

* ^ %to 90%. The catalytic domain may be 

from SSs to . m of a protein 

. .nt-itv The catalytic domain or a v 
as a separate entity, in polypeptide 
• Hnase is also characterized as a polyp P 
25 tyrosine kinase is» 

that is soluble in solution. 

The term "identity- identity as used here 
to a property of seances ^ ^ by 
similarity or relati onshr. *» V ^ ^ _ 
30 dividing the number of rdentrcal 

sequences by the total number of resrd 
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of, amino acids comprising a protein. 

A preferred embodiment of the invention includes a 
crystalline form of a PTK that is a receptor PTK . 
Receptors are proteins that straddle the inside and 
5 outside of the cell membrane. Receptor PTKs comprise an 
extracellular region, a transmembrane region, and an 
intracellular region comprising a catalytic domain. 

Another preferred embodiment of the invention is 
the crystalline form of a receptor PTK selected from the 
0 group consisting of FGF-R, PDGF-R, FLK, CCK4 , MET, TRKA, 
AXL, TIE, EPH, RYK, DDR, ROS, RET, LTK, R0R1 , and MUSK . 

Yet another preferred embodiment of the invention 
is the crystalline form of a PTK that is a non-receptor 
PTK. Non-receptor PTKs are located inside the cell and 
5 do not harbor extracellular or membrane -spanning 

polypeptides attached to the polypeptide corresponding 
to the catalytic domain. Non-receptor PTKs may harbor 
fatty acids or lipids, which can impart a membrane 
associated character to a PTK. In preferred embodiments 
3 of the invention, crystalline forms of non-receptor PTKs 
are selected from the group consisting of SRC, BRK, BTK, 
CSK, ABL, ZAP70, FES , FAK, JAK, and ACK. 

In still another preferred embodiment, the 
invention features a crystalline form of a PTK that 
3 comprises a heavy metal atom. These types of crystals 
can be referred to as derivative crystals. 

The term "derivative crystal" refers to a crystal 
where the polypeptide is in association with one or more 
heavy-metal atoms . 
) The term "association" refers to a condition of 

proximity between a chemical entity or compound, or 
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In a preferred embodiment th. 

of an fgf receptor t """"^ fe " Ures * 

The pep r yr ° Sine ^m ain 

" re -l-ate to PGPRi . 
The term "FGFRi" refers 

receptor PT Ks that arp h ° ^ member ° f ™ltiple 

tnat are homoloqous t-r. ~ 
bind FGF . In th . 9 US t0 one another and 

criis context 4 

- « - ast 70% amino aci :; c ;;v homoio9ous " ref - 

of the FGPR family ldent ">' b "weer, two members 

The term - FGFR1 „ can also 

hman FGFR1 -** is ch a r a cteri 2 ed L ' " 
sequence of seq ID N0 ., a ^ by the ami "° *cid 

contains the folio C ° mPared '° """^ P ° PR1 - 

«*--«.. - Ala , 1 ;: r° acid 

a°°ition a l five am . ,' er ' LeU " 4 " " v al, and has an 

<ser- Ala - A1 a. G1 ;;;;. acid res ^ « «- 
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-naae dom ain of human *> Cosine 

1 havi ng the amino a " 9r ° Wth fa «- receptor 

amino ao ld sequence of SEQ ID 
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NO:l. Generally, human FGFR1 comprises a 310 amino acid 
residue fragment (residues 456 to 765) of human FGFR1 . 

The term "mutant" refers to a polypeptide which is 
obtained by replacing at least one amino acid residue in 
5 a native tyrosine kinase domain with a different amino 
acid residue. Mutation can be accomplished by adding 
and/or deleting amino acid residues within the native 
polypeptide or at the N- and/or C-terminus of a 
polypeptide corresponding to a native tyrosine kinase 

10 domain having substantially the same three-dimensional 
structure as the native tyrosine kinase domain from 
which it is derived. By having substantially the same 
three-dimensional structure is meant having a set of 
atomic structure coordinates that have a root mean 

15 square deviation (r.m.s.d.) of less than or equal to 
about 2 A when superimposed with the atomic structure 
coordinates of the native tyrosine kinase domain from 
which the mutant is derived when at least about 50% to 
100% of the Ca atoms of the native tyrosine kinase are 

2 0 included in the superposition. A mutant may have, but 
need not have, PTK activity. 

In another preferred embodiment, the invention 
relates to a crystalline form defined by the structural 
coordinates set forth in Table 1 . 

25 The term "atomic structural coordinates" as used 

herein refers to a data set that defines the three 
dimensional structure of a molecule or molecules. 
Structural coordinates can be slightly modified and 
still render nearly identical three dimensional 

30 structures. A measure of a unique set of structural 
coordinates is the root -mean- square deviation of the 
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resulting structure, structural coordinates that render 
three dimensional structures that deviate from one 
another by . root -mean-sguare deviation of less than X 5 
may be viewed hy . person o£ ordiMry ^ ^ ^ ^ 

as identical. Hence, the structural coordinates set 

limited to the values defined therein. 

m other preferred embodiments, the invention 
features a crystalline form of the polypeptide in 
association with a compound. These types of crystalline 

rorms can be referrpH ^ 

j-^i-errea to as co- crvsta 1 q t l„ 

iybcais ' The compound 

may be a cofactor, substrate, substrate analog 
inhibitor, or allosteric effector. 

The term "compound- refers to an organic molecule 
The term -organic molecule" refers to a molecule which 
has at least one carbon atom i n iCs structure The 
compound can have a molecular weight of less than 6kDa 
Both the geometry of the compound and the interactions 
formed between the compound and the polypeptide 
Preferably govern high affinity binding between the two 
-lecules. High affinity binding is preferably governed 
by a dissociation equilibrium constant on the order of 
10 6 M or less. The compound is preferably a modulator 
that alters the function of a PTK. 

The term "function," in reference to the effect of 
a modulator on PTK function, refers to the ability of a 
modulator to enhance or inhibit the catalytic activity 
of a PTK. y 

The term "catalytic activity", in the context of 
the invention, defines the ability of a PTK to 
phosphorylate a substrate polypeptide. Catalytic 
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activity can be measured, for example, by determining 
the amount of a substrate converted to a product as a 
function of time. The conversion of the substrate to a 
product occurs at the active-site of the PTK . 
5 The term w active-site" refers to a cavity located 

in the PTK in which one or more substrate molecules may 
bind. Addition of a modulator to cells expressing a PTK 
may enhance (activate) or lower (inhibit) the catalytic 
activity of the PTK. 
0 A small number of inhibitors of PTK catalytic 

activity are known in the art. Small molecule 
inhibitors may modulate PTK function by blocking the 
binding of substrates. Indolinone compounds, for 
example, may bind to the active-site of PTK catalytic 
domains and inhibit them effectively, as measured by 
inhibition constants on the order of lCT 6 M or less. 

Activators of PTK intracellular regions can enhance 
PTK function by interacting with both the PTK catalytic 
domain and the substrate. Activators may also promote 
dimerization of PTKs and thus activate them by bringing 
them into close proximity with one another. In 
addition, activators may operate by promoting a 
conformational change in the intracellular region of the 
PTK such that the catalytic region modifies substrates 
at a faster rate in the presence of the activator. 

The term n function" can also refer to the ability 
of a modulator to enhance or inhibit the association 
between a PTK and a natural binding partner. 

The term u natural binding partner" refers to a 
polypeptide that normally binds to a PTK in a cell. 
These natural binding partners can play a role in 
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P t.xa kmases or protein phosphatases. 
The term "interactions" refers fn h ^ 
aromatic an rt ■ meters to hydrophobic, 
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site. the enz yme active- 
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undergo a ,h» ■ , * Protein and 

9 S che » 1 « 1 reaction. Multip le co- facto 
nucieotides or nuoieotide derivatives h 
« -cotin^ide der.vatives o £ IZ s i„7 " 

reacti"::: " substrate " refers c ° a — «~ 
~ .:r;:;r 

-tratee f or ^ ^ ~ ~ - 

The term "substrate analog" refers to „ 
^at is structurally similar b , 
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The term "inhibitor" refers to a compound that 
decreases the cellular function of a protein kinase. 
The protein kinase function is preferably the 
interaction with a natural binding partner and more 
5 preferably catalytic activity. 

The term "allosteric effector" refers to a compound 
that causes allosteric interactions in a protein. The 
term "allosteric interactions" refers to interactions 
between separate sites on a protein. The sites can be 
10 different from the active site. The allosteric effector 
can enhance or inhibit catalytic activity by binding to 
a site that may be different than the active site. 

The term "co-crystal" refers to a crystal where the 
polypeptide is in association with one or more 
15 compounds . 

In preferred embodiments, a co-crystal of the 
invention can be in association with a heavy metal atom. 
Examples of heavy metal atoms are described above. 
In other preferred embodiments, the invention 
20 features a co-crystal comprising the crystalline form of 
the polypeptide in association with a compound, where 
the compound is a non-hydrolyzable analog of ATP. These 
analogs can be referred to as nucleotide analogs. 

The term "ATP" refers to the chemical compound 
25 adenosine triphosphate. 

The term "non-hydrolyzable" refers to a compound 
having a covalent bond that does not readily react with 
water . Examples of non-hydrolyzable analogs of ATP are 
AMP-PNP and AMP-PCP, whose structures are well known to 
30 those skilled in the art. 

The term "AMP-PNP" refers to adenylyl 
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or a pharmaceutically acceptable salt, isomer, 

metabolite, ester, amide, or prodrug thereof, where: 

(a) A lr A 2 , A 3/ and A 4 are independently carbon or 
5 nitrogen; 

(b) Rj is hydrogen or alkyl; 

(c) R 2 is oxygen in the case of an oxindolinone or 
sulfur in the case of a thiolindolinone ; 

(d) R 3 is hydrogen; 

10 (e) R 4 , R 5 , R 6/ and R 7 are optionally present, and 

are either (i) independently selected from the group 
consisting of alkyl, alkoxy, aryl, aryloxy, alkaryl, 
alkaryloxy, halogen, trihalomethyl , S(0)R, SO ? NRR 1 , S0 3 R, 
SR, N0 2/ NRR ' , OH, CN, C(0)R, OC(0)R / NHC(0)R, (CH 2 ) n C0 2 R, 

15 and CONRR 1 or (ii) any two adjacent R 4/ R 5 , R 6/ and R 7 

taken together form a fused ring with the aryl portion 
of the indole-based portion of the indolinone; 

(f) R 2 ', R 3 ', R 4 1 , R 5 ', and R 6 1 are each 
independently selected from the group consisting of 

2 0 hydrogen, alkyl, alkoxy, aryl, aryloxy, alkaryl, 

alkaryloxy, halogen, trihalomethyl, S(0)R, S0 2 NRR*, S0 3 R, 
SR, N0 2 , NRR', OH, CN , C(0)R, 0C(0)R, NHC(0)R, (CH 2 ) n C0 2 R f 
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and CONRR ' ; 

( 9> n is 0, i, 2 , or 3; 

(h) R is hydrogen, alkyl or aryl; 

(i) R' is hydrogen, alkyl or aryl ; and 

fro m the " ^ 3 fiVG membSred h6ter0aryl rtn * -1-cted 
the group consi sting of thiophenS/ 

o" r ::r e '- imida2oie ' — i.*.,.^.. 

oxazole, lsoxa2ole , i-o ' 

oxadiazole l 9 a 1,2,3- 

xe ' - 1 ' 2, 4 -oxadiazole i •? c ^ ^ . 

e ' x ' 2 > 5 -oxadiazole i ^ /i 
oxadiazole ] r> ■* „ . 1,3,4- 

ie ' 1 ' 2 ' 3 '4-oxatriazole l ? ■» R 
i 2 . ... ' 1 ' 2 ' 3 ' 5 -oxatriazole, 

ie ' 1 ' 2 '4-tmadiazole l ? c ^ . 

13 4 f-h-;^- n 2 ' 5-thiadiazole, 

1. . th ladia2ol e, ,,,3,,.^^ 3 

th.atr^ole, and tecra2Qlei optionai ^ 

one „ positlMs ^ - a 

a»ar yl , alkaryloxy , halogen trihalQmet / 

S°^,SO,R. SR, N0 2 , NRR ' , OH, CN, C ( 0,R OCOR 
«HC(0)R. <CH !)n C0 2 R or CONRR ' 

The ter m ..p harmac eutica lly ac eept ab le aalc „ rfi£ers 
to those salts which recain che Moiogicai - 

P,ope rties of the £re e bases . Ph a r«ic ally 

ao=ep tabl e s al t a oan be obtaine(J by 

« such as ^^^^^^ ^ ^ 

"id. sul£uric acid nUric ^ phosphoric 
-thenesulfonic acid, ethenesulfonic a cid „. 

t0lUeneSUlf ° niC -licylie acid and che like 

™» ter„ -prodrug" refers to an agent ^ ^ 

rir:: ; d nco the parenc dru9 ±n — ■ — - 

6 ° adminiStSr tha « "» Parent drug in som e 

" Uatl ° nS - P ° r the prodrug may be 

bi o a vaU able by oral administratiM 

not. or the prodrug may improve solubiUty co ^ ^ 
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intravenous administration . 

"Alkyl M refers to a straight -chain, branched or 
cyclic saturated aliphatic hydrocarbon. Preferably, the 
alkyl group has 1 to 12 carbons. More preferably, it is 
a lower alkyl of from 1 to 7 carbons, more preferably 1 
to 4 carbons. Typical alkyl groups include methyl, 
ethyl, propyl, isopropyl, butyl, isobutyl, tertiary 
butyl, pentyl, hexyl and the like. The alkyl group may 
be optionally substituted with one or more substituents 
are selected from the group consisting of hydroxy 1 , 
cyano, alkoxy, =0, =S, N0 2 , halogen, N(CH 3 ) 2 amino, and 
SH. 

"Alkenyl" refers to a straight -chain, branched or 
cyclic unsaturated hydrocarbon group containing at least 
one carbon-carbon double bond. Preferably, the alkenyl 
group has 2 to 12 carbons. More preferably it is a 
lower alkenyl of from 2 to 7 carbons, more preferably 2 
to 4 carbons. The alkenyl group may be optionally 
substituted with one or more substituents selected from 
the group consisting of hydroxyl, cyano, alkoxy, =0, =S, 
N0 2 , halogen, N(CH 3 ) 2 amino, and SH. 

"Alkynyl" refers to a straight -chain, branched or 
cyclic unsaturated hydrocarbon containing at least one 
carbon-carbon triple bond. Preferably, the alkynyl 
group has 2 to 12 carbons. More preferably it is a lower 
alkynyl of from 2 to 7 carbons, more preferably 2 to 4 
carbons. The alkynyl group may be optionally 
substituted with one or more substituents selected from 
the group consisting of hydroxyl, cyano, alkoxy, =0, =S, 
N0 2 , halogen, N(CH 3 ) 2 amino, and SH. 

"Alkoxy" refers to an "O-alkyl" group. 
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least oT^ " an ar ° maCiC ^ - at 

least one ring having a con ju gated pi-electron 

and i„cl udes carbocyclic aryl het yStem 
W«yl groups. The aryl " ' h — «yi and 

yl 9rou P "» a y be optional )v 
' ^stitutea with one or m ore substitute I ed £ 

-e group consisting ot hal o 9 en, t^^^ ' f ™ 
hydroxyl, SH OH wn 

' ° H ' N0 *' a "iine, thioether CVa nn 
•Ikyl. and amino. 7 n °' alkox ^ 

.oinedf 1 " ^ ^ " kyl ^ - — ly 

"Carbocyclic aryl- re f ers to an arvl 

"Heterocyclic aryl" referc t-o a 
from i to 3 her. n aryl grou P h ^ing 

the rin. heter ° at ° mS " — the remainder of 

oxl ^ b6ing Carb ° n - "—atoms include 

oxygen, sulfur, and nitrogen Th ua K 

and the like. PXrazmyl, inudazolyi 

••Amide" refers to -C(0)-NH r u 

H_R ' where R i s a lk-vi 
aryl, alkylaryl or hydrogen alk ^ 

"Thioamide- refers to - C f<3) mu „ 
a^l. al kylar yl or h y drogen where R is alItyl , 

J*-." -ers eoa. H(R , )R ,, group _ her 

are independently selected fro m 

of alkyl arvl „ \ * 9r ° UP cons ^ting 

yi ' ar yl, and alkylaryl. 

"Thioether- refers to -S-p „», 

t>-R, where R i s aii,ui -, 
or alkylaryl. alkyl, aryl, 

"Sulfonyl" refers to -s(o) r u 
C(C N)=C -aryl CH CN , llr ^) 2 -R, where R is aryl, 

CH 3 CN, alkyaryl, sulfonamide, m . alkyl/ m _ 
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alkylaryl, or NH-aryl. 

The term "acyl" denotes groups -C(0)R, where R is 
alkyl as defined above, such as f ormyl , acetyl, 
propionyl , or butyryl . 
5 It is understood by those skilled in the art that 

when A lf A 2 , A 3 , and A 4 are nitrogen or sulfur that the 
corresponding R 4 , R 5 , R 6/ and R 7 , as well as the 
corresponding bond, do not exist. 

Examples of indoles having such fused rings (as 
10 described in (e) (ii) above include the following: 





The six membered rings shown above exemplify 
possible A rings in compound II. 
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Other preferred embodiments of the invention are 
crystalline forms comprising 3 - [ (3 - ( 2 -carboxyethyl ) -4 - 
methyl P yrrol-5-yl)methylene]-2-indolinone as well as 3- 
[4- (4-formylpiperazine-l-yl-)benzylidenyl] - 2 - indolinone . 
The polypeptide of these crystalline forms can be FGFR , 
and specifically, FGFRl. 

In preferred embodiments, the crystalline forms of 
the invention can be defined by the structural 
coordinates set forth in Table 3 or Table 4. 

The use of X-ray crystallography can elucidate the 
three dimensional structure of crystalline forms of the 
invention. The first characterization of crystalline 
forms by x-ray crystallography can determine the unit 
cell shape and its orientation in the crystal. 

In other preferred embodiments, the invention 
features a crystal of an FGF receptor tyrosine kinase 
domain protein, where the crystal is characterized by 
having monoclinic unit cells. The crystal may also be 
characterized by having space group symmetry C2 . 

The term "unit cell" refers to the smallest and 
simplest volume element (i.e., parallelpiped- shaped 
block) of a crystal that is completely representative of 
the unit of pattern of the crystal. The dimensions of 
the unit cell are defined by six numbers: dimensions a, 
b and c and angles a, fj and y . A crystal can be viewed 
as an efficiently packed array of multiple unit cells. 
Detailed descriptions of crystallographic terms are 
described in, which is hereby incorporated herein by 
reference in its entirety, including any drawings, 
figures, and tables. 

The term "monoclinic unit cell" refers to a unit 
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cell where a^b^c; a = y = 90°; and (3 > 90°.. 

The term "space group" refers to the symmetry of a 
unit cell. In a space group designation (e.g., C2) the 
capital letter indicates the lattice type and the other 
5 symbols represent symmetry operations that can be 
carried out on the unit cell without changing its 
appearance . 

The term "lattice" in reference to crystal 
structures refers to the array of points defined by the 
10 vertices of packed unit cells. 

The term "symmetry operations" refers to 
geometrically defined ways of exchanging equivalent 
parts of a unit cell, or exchanging equivalent molecules 
between two different unit cells. Examples of symmetry 
15 operations are screw axes , centers of inversion , and 
mirror planes. 

In a preferred embodiment, the invention features a 
crystalline form, where the monoclinic unit cells have 
dimensions of about a=208.3 A, b=57.8 A, c=65 . 5 A and 
20 3=107.2°. 

In a preferred embodiment, the invention features a 
FGFR1 crystal, where the monoclinic unit cells have 
dimensions of about a=211.6 A, b=51.3 A, c=66.1 A and 
3=107.7°. 

25 In another aspect the invention features a 

polypeptide corresponding to the catalytic domain of a 
protein tyrosine kinase, containing at least about 20 
amino acid residues upstream of the first glycine in the 
conserved glycine-rich region of the catalytic domain, 

30 and at least about 17 amino acid residues downstream of 
the conserved arginine located at the C-terminal 
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boundary of the catalytic domain 

The polypeptides of the 
enriched or purified , ^J™^™ «" b * Elated, 
forms of the "° n ' ^ 

• a 0 ™" ^ " '~ 

isolated, enriched, or purified. 

B V "isolated" in -*-~,r 

-«t a p olymer ^ " <° " -peptide is 

con^ated to each oh' " " ami " 0 

are isolated from a 9 POl ^«"es that 

synthesi^d i: v r urai source ° r - e 

— tion aretinrr 1 ^"^ * ~ 

X ^ ue in the senqp . . 

'o- in a pure or separated state "* 

«- t« .isolated, indicates hi °"* °* 

-ence has been removed ^ LI^™ 9 

en ; lr ° nment - ^ se^ence may he in 

-lutlon or placed in a different cellular e 

™e term does not imply that ^ 

amino acid chain SeqUenCe iS Che ° nl y 

chain present, but that- it. ■ 

free (about 90 - 9S » „ 1S esse ntially 

95, pure at least) of material 

naturally associated with it . 

By the use of the t-^>-m 

urie term "enrichpH" ^ 

Peptide it is meant that the specific " 8 

sequence constit„t s Pocific amino acid 

" ■ fold, a ; o 7""-«V hi g her fraction „ 

- or -X^-TjlrL" 01- " PreSent " 

™n rest than m normal or dise a «oH 

cells or in th P t ^ aiseased 

could b a e b ^ ^ S ~* ~ 
reduc, ■ ■ ^ * PSrSOn by P"farential 

reduction in the amount of other a „, 

*V a preferential increase L th " 

Liease m the amount of t-v^ 

amino acid seguence of interest or h SPeC1 " C 

the two. „ oweV e r it h ' ' ^ " COmbi " a «°n of 

' " Sh ° UW be n °^o that -enriched" 
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does not imply that there are no other amino acid 
sequences present, just that the relative amount of the 
sequence of interest has been significantly increased. 
The term significant here is used to indicate that the 
5 level of increase is useful to the person making such an 
increase, and generally means an increase relative to 
other amino acids of about at least 2 fold, more 
preferably at least 5 to 10 fold or even more. The term 
also does not imply that there are no amino acids from 

10 other sources. The other source amino acids may, for 
example, comprise amino acids encoded by a yeast or 
bacterial genome, or a cloning vector such as pUC19. 
The term is meant to cover only those situations in 
which a person has intervened Lo elevate the proportion 

15 of the desired nucleic acid. 

It is also advantageous for some purposes that an 
amino acid sequence be in purified form. The term 
"purified" in reference to a polypeptide does not 
require absolute purity (such as a homogeneous 

20 preparation) ; instead, it represents an indication that 
the sequence is relatively purer than in the natural 
environment (compared to the natural level this level 
should be at least 2-5 fold greater, e.g., in terms of 
mg/ml) . Purification of at least one order of 

25 magnitude, preferably two or three orders, and more 

preferably four or five orders of magnitude is expressly 
contemplated. The substance is preferably free of 
contamination at a functionally significant level, for 
example 90%, 95%, or 99% pure. 

30 In a preferred embodiment, the invention features a 

polypeptide corresponding to the catalytic domain of a 
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receptor PTK . The receptor pTK a thr ^_ 

a.mensionai structure substantially similar to 

the r„ su i ln receptor, even though ^ am 

may be different. content 

m a preferred embodiment, the invention features a 
Po ypept.de corresponding to the catalytic domain of a 
non-receptor PTK, where tne non-i„ S ulin reoeptor 
tyrosine kinase is a cytoplasmic tyrosine kinase 

In a preferred embodiment, the invention features a 
- ypeptide oorrespondin, to the catalytic domain o 
«o ptor PTK, seiected from the g roup consisting of FGP 
«. «»-». KDR, CCK4. MET, TRKA, AXL, TIE, Ep„ R « 

in a preferred embodiment, the invention features a 
PO ypeptrde oorrespondin, to the catalytic domain of 
non. e r PTK, seieoted from the group oonsisti„ g of 

in a preferred embodiment, the invention features a 
peptide oorrespondin, to the catalytic domain of 

HIT" 3 ami "° a ° id SeqUenM *» I"able i or 

for cltr"" ^ ^ inVanti ° n * — 

tor creatrn 9 crystalline forms described herein. The 

-thod may utilise the poiypeptides described herein to 
form a crystal. The method comprises the steps of- 

(a, mixing a volume of polypeptide solution 
with a reservoir solution; and 

(W incubating the mixture obtained in step 
«.) over the reservoir solution in a closed container 
-der conditions suitable for cryatalli.ation 

These processes are described in detail in the 
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section entitled "Detailed Description of the 
Invention . " 

In another aspect, the invention features a method 
of obtaining FGF receptor tyrosine kinase domain 
5 polypeptide in crystalline form, comprising the steps 

of: (a) mixing a volume of polypeptide solution with an 
equal volume of reservoir solution, where the 
polypeptide solution comprises 1 mg/mL to 6 0 mg/mL FGF- 
type tyrosine kinase domain protein, 10 mM to 200 mM 
10 buffering agent, 0 mM to 20 mM dithiothreitol and has a 
pH of about 5.5 to about 7.5, and where the reservoir 
solution comprises 10% to 30% (w/v) polyethylene glycol, 
0 . 1 M to 0 . 5 M ammonium sulfate, 0% to 20% (w/v) 
ethylene glycol or glycerol, 10 mM to 200 mM buffering 

15 agent and has a pH of about 5.5 to about 7.5; and (b) 
incubating the mixture obtained in step (a) over said 
reservoir solution in a closed container at a 
temperature between 0° and 25°C until crystals form. 

In a preferred embodiment, the invention features a 

20 method of obtaining FGF receptor tyrosine kinase domain 
polypeptide in crystalline form, where the polypeptide 
solution comprises about 10 mg/mL FGF receptor tyrosine 
kinase domain, about 10 mM sodium chloride, about 2 mM 
dithiothreitol, about 10 mM Tris-HCl and has a pH of 

25 about 8; the reservoir buffer comprises about 16% (w/v) 
polyethylene glycol (MW 10000), about 0.3 M ammonium 
sulfate, about 5% ethylene glycol, or glycerol, about 100 
mM bis-Tris and has a pH of about 6.5; and the 
temperature is about 4°C. 

30 in another preferred embodiment, the invention 

features a method of obtaining FGF receptor tyrosine 
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kinase domain polynent- i ,■ 

t^ypeptide i n crystalline f n ™ 

In still another prefers u 

prererred embodiment th#» 

Kinase domain polypeptide i n crystal! s * 
wherp 4_ _ u:r ystalline form 

wnere the compound is a nucleot-ir^ , 

=o mPOUnds have th r gen : r ; indounone - 

« c, eKribed herein genSral structural formula 

In another asoect- • 
enco din9 an FGP r j; C —3 a CDNA 

"n.re a codin g stran T f \ MnaSe d0mai " 

xng strand of the cDNA h-.o #-u 

Another aspect of the invention relates to . „ 
of aeter mi n ing three dimensional structure 
untaown structure b y utili2ina ° f -ith 

„„ .. y 1I121n 9 the structural 

coordinates of Table i, TaMe 2 ... , 

These methods can relate to H , 

replacement and , ^ m ° delin9 ' ^« 

-»*. and nuclear magnetic resonance methods 

in a preferred embodiment ph„ • 

a method of dete • • invention relates to 

PTKs 1 dete ™"« 9 three dimensional structures of 

KS ~ n -ructures b y uti lizin3 the 

conjunction with th~ ■ Table 4 m 

n with the ammo acid sequences of PTKs th ■ 
method of homology modeling hlS 

modeling comprises the steps of- r al 
aligning the computer representation of an 
sequence of a ptk w^h , ln ° aCid 

or a ptk with unknown structure wihh -v, «. 

PTK with known structure, where ■ ° £ 9 

etching homologous regi ns of th " » 

9 of th e amino acid seguences; 
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(b) transferring the computer representation of an amino 
acid structure in the PTK sequence of known structure to 
a computer representation of a structure of the 
corresponding amino acid in the PTK sequence with 

5 unknown structure; and (c) determining low energy 
conformations of the resulting PTK structure. 

The term "amino acid sequence" describes the order 
of amino acids in the amino acid chain comprising a 
polypeptide corresponding to the catalytic domain of a 

0 PTK . 

The term "aligning" describes matching the 
beginning and the end of two or more amino acid 
sequences. Homologous amino acid sequences are placed 
on top of one another during the alignment process. 
5 The term "homologous" describes amino acids in two 

sequences that are identical or have similar side-chain 
chemical groups (e.g., aliphatic, aromatic, polar, 
negatively charged, or positively charged) . 

The term "corresponding" refers to an amino acid 
0 that is aligned with another in the sequence alignment 
mentioned above . 

The term "determining the low energy conformation" 
describes a process of changing the conformation of the 
PTK structure such that the structure is of low free 
5 energy. The PTK structure may or may not have molecules, 
such as modulators bound to it. 

The term "low free energy" describes a state where 
the molecules are in a stable state as measured by the 
process. A stable state is achieved when favorable 
0 interactions are formed within the complex. 

The term "favorable interactions" refers to 
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hydropic, aromatic, and ionic £orces _ ^ 
bonds . 

Another preferred embodiment of the invention 
relates to a method of determining three dimensional 
structures of PTKs with unknown ^ 

» aocompiished by appiying che struccural coordinates 
of Table 1, Table 2, Table 3. or Table 4 to an 
^complete x-ray crystallography data set for a PTK 
The method comprises the steps of, <a, aligning the ' 
positions of atoms in the unit cell by matching electron 
diffraction data from two crystals, where one data set 
" C ° mPlete and th * other is incompiete; and ,b) 
determining a low energy conformation of the resulting 
PTK structure. 

The term "incomplete data set" relates to a x-ray 
crystallographic data set that does not have enough 
information to give rise to a three dimensional 
structure. 

In another preferred embodiment, the invention 
relates to a method of determining three dimensional 
structures of PTKs with unknown structure by applying 
the structural coordinates of Tabic i, Table 2. Table 3 
or Table 4 to nuclear magnetic resonance (HMR) data of a 
PTK. This method comprises the steps of- (a) 
determining the secondary structure of a PTK structure 
using NMR data,- and ( b) simplifying the assignment of 
through-space interactions of amino acids. The PTK 
structure may not be complexed with compounds or 
modulators . 

The term "secondary structure" describes the 
arrangement of amino acids in a three dimensional 
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structure, such as in a-helix or (3-sheet elements. 

The term "through-space interactions" defines the 
orientation of the secondary structural elements in the 
three dimensional structure and the distances between 
5 amino acids from different portions of the amino acid 
sequence . 

The term "assignment" defines a method of analyzing 
NMR data and identifying which amino acids give rise to 
signals in the NMR spectrum. 

10 In another aspect, the invention features a method 

of identifying potential modulators of PTK function. 
These modulators are identified by docking a computer 
representation of a structure of a compound with a 
computer representation of a cavity formed by the 

15 active-site of a PTK. The computer representation of 
the PTK active- site structure can be defined by 
structural coordinates . 

The term "chemical group" refers to moieties that 
can form hydrogen bonds, hydrophobic, aromatic, or ionic 

20 interactions. 

The term "docking" refers to a process of placing a 
compound in close proximity with a PTK. The term can 
also refer to a process of finding low energy 
conformations of the compound/PTK complex. 

25 A preferred embodiment of the invention is a method 

of identifying potential modulators of PTK function. 
The method involves utilizing the structural coordinates 
or a PTK three dimensional structure. The structural 
coordinates set forth in Table 1, Table 2, Table 3, and 

30 Table 4 can be utilized. The method comprises the steps 
of: (a) removing a computer representation of a PTK 
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structure and docking . CQmputer tepresentat 
co~ from . computer data uith a 

"T"™" 1 " ° £ aCti " e - Si - ° £ «- PTK; (b) 

determining a conformation of fh a 

ation ot the complex with a 

n 7 aMS - '—Me compiementary 

ruTT' ^ <C ' ident " yin9 c ~ s ^ -« 

function. The inittal pTR ^^^^ J 
compounds bound to it. aVe 

The term . £avorable geom£tric t , t . 
=on o „ Q£ the com _^ R ^ uhsre 

rrr.iv!; ;r ound is in — — 

area of the active-site without formi na 

—--3. unfavorable inceract 

ce ster lc hindrances between atoms in the 
atoms in the PTK active-site. 

reiaJsVT * , ' VOr ' W " interactions- 
relates to hydrophobic, aromatic, ionic, and hydrogen 

b :::r 3 ' and hyd ~ 9en bond *«- 

etween the compound and the PTK active -site 

of ptkTT " POt9ntial " 9Uali£i6S the te ™ 

°t PTK function- because the potentiai modulator 



The term "best fit-" 

flt Ascribes compounds that 

P K faV ° raWe ^ — —actions with 

PTK rn the screen in a given experiment. 

meth r ther Pre£erred emb ° di -« <* the invention is a 
-thod of identifying potentiai mediators o£ pTR 

function. The method invoives utiUzing a three 
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dimensional structure of a PTK, with or without 
compounds bound to it. The method comprises the steps 
of: (a) modifying a computer representation of a PTK 
having one or more compounds bound to it, where the 
5 computer representations of the compound or compounds 
and PTK are defined by structural coordinates; (b> 
determining a conformation of the complex with a 
favorable geometric fit and favorable complementary 
interactions; and (c) identifying the compounds that 
10 best fit the PTK active-site as potential modulators of 
PTK function. 

The term "modifying" relates to deleting a chemical 
group or groups or adding a chemical group or groups. 
Computer representations of the chemical groups can be 

15 selected from a computer data base. 

Yet another preferred embodiment of the invention 
is a method of identifying potential modulators of PTK 
function by operating modulator construction or 
modulator searching computer programs on the compounds 

20 complexed with the PTK. The method comprises the steps 
of: (a) removing a computer representation of one or 
more compounds complexed with a PTK; and (b) searching a 
data base for compounds similar to the removed compounds 
using a compound searching computer program, or 

25 replacing portions of the compounds complexed with the 
PTK with similar chemical structures from a data base 
using a compound construction computer program, where 
the representations of the compounds are defined by 
structural coordinates . 

30 The term "operating" as used herein refers to 

utilizing the three-dimensional conformation of 
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-lecules defined by che processes 

var ious computer programs. " 

The term "similar compound" refers to ■ 
a computer data base that h a compound in 

; . at has a similar geometric 

structure as compounds that can bin d to a PTK The 
a»iiar compound can aiso have simiiar chemical 
as the compounds that are either bo 1 
once bound to the PTK Th « 

*» complementar/ nteraclior 1 ";"" 1 ~ ~ 

j inLeractions with the PTK 

descrLV 6 ™ Sear ° hin9 *°°<*~ 

21 3 COmPUter Pr09 " m Ch « -arches computer 

~ a :;:r~ - * -~ - - 

cnree Qlmansional structures «nrf 
similar chemical groups as 

groups as a compound of interest n 
compound of interest is preferable 
compound. Preferably an mdolinone 

The term "similar chemical structures" refers to 

7 ~ — — r geometry as po t ns 

of the compounds in complex with the PTK or 
-moved from the PTK structure simila h 
structures can also refer to ch 

form simi, ohenucal groups that may 

form srmrlar compiementary interactions as portions of 
the compounds in compie* „ ich the pTK or e Cl ° nS ° f 
removed from the PTK structure. 

The term "replacing structures" refers to . • 
* -tion of the compounds in complex wi h t 
compounds removed from the PTK structure and c 
«- broken bonds to a similar ^Jl^^ 

The term "compound construction ~ 
describes , ^uctxon computer program- 

escribes a computer program that reclame 
r-^v. replaces computer 

representations of chemical groups i n a 

a ups ln a compound with 
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groups from a computer data base. The compound is 
preferably an indolinone compound. 

The term "similar three dimensional structure" 
describes two molecules with nearly identical shape and 
5 volume . 

In another preferred embodiment of the invention, 
the PTK structures used in the modulator design or 
identification method of the invention are defined by 
the structural coordinates of Table 1, Table 2, Table 3, 

10 or Table 4 . 

The methods for using the crystalline forms and 
three dimensional structures of the invention can relate 
to a broad range of protein kinases. Thus, in preferred 
embodiments, the invention relates to a receptor PTK. 

15 The receptor PTK can be selected form the group 

consisting of FGF-R, PDGF-R, FLK, CCK4 , MET , TRKA, AXL, 
TIE, EPH, RYK, DDR, ROS, RET, LTK, ROR1 , and MUSK. The 
PTK may also exist as a non-receptor PTK. The non- 
receptor PTK can be selected from the group consisting 

20 of SRC, BRK, BTK, CSK, ABL, ZAP70, FES, FAK, JAK, and 
ACK. 

In another aspect, the invention features a 
potential modulator of PTK function identified by 
methods disclosed in the invention. 

25 A preferred embodiment of the invention is that the 

potential modulator of PTK function is an oxindolinone 
or a thiolindolinone of formula I or II disclosed above. 

Another aspect of the invention is a method for 
synthesizing a potential modulator of PTK function or 

30 its pharmaceutical^ acceptable salts, isomers, 

metabolites, esters, amides, or prodrugs by a standard 
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10 



15 



20 



25 



30 



synthetic method known in the art ^nt-h^t- ■ 

me arc. Synthetic procedures 

are discussed below. 

in another aspect, the invention features a method 
°£ identifying a potential modulator of PTK function as 
a modulator of PTK function. The method composes the 
steps of, ,«, administers a potential modulator of PTK 
function to cells; ,b, comparing the level of PTK 
Phosphorylation between cells not administered the 
Potential modulator and ceils administered the potential 
modulator; and ,c, identifying the potential modulator 
as a modulator of PTK function based on the difference 
■ in the level of PTK phosphorylation. 

The term "cells" refers to any type of cells either 
Primary or cultured. Primary cells can be extracted 
drrectly from an organism while cultured cells rapidly 
divide and can be cultured in many successive rounds 
Cells can be grown in a variety of containers including 
but not limited to flasks, dishes, and well plates 

The term "administer- refers to a method of 
delivering a compound to cells. The compound can be 
prepared using a carrier such as dimethyl sulfoxide 
<DMS0) in an aqueous solution. The aqueous solution 
comprising the compound, also termed an "aqueous 
preparation", can be simply mixed into the medium 
bathing the layer of cells or microinjected into the 
cells themselves. The compounds may be administered to 
the cells using a suitable buffered solution. 

The term "suitable buffered solution" refers to an 
aqueous preparation of the compound that comprises a 
salt that can control the pH of the solution at low 
concentrations. Because the salt exists at low 
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concentrations, the salt preferably does not alter the 
function of the cells. 

The term w PTK phosphorylation" refers to the 
presence of phosphate on the PTK. Phosphates on PTKs 
5 can be identified by antibodies that bind them 
specifically with high affinity. 

In another aspect, the invention features a method 
of identifying a potential modulator of PTK function as 
a modulator of PTK function. The method comprises the 
steps of: (a) administering a potential modulator of PTK 
function to cells; (b) comparing the level of cell 
growth between cells not administered the potential 
modulator and cells administered the potential 
modulator; and (c) identifying the potential modulator 
as a modulator of PTK function based on the difference 
in cell growth. 

The term "cell growth" refers to the rate at which 
a group of cells divides. Cell division rates can be 
readily measured by methods utilized by those skilled in 
the art . 

Another aspect of the invention features a method 
of diagnosing a disease by identifying cells harboring a 
PTK with inappropriate activity. The method comprises 
the steps of: (a) administering a modulator of PTK 
function to cells; (b) comparing the rate of cell growth 
between cells not administered the modulator and cells 
administered the modulator; and (c) diagnosing a disease 
by characterizing cells harboring a PTK with 
inappropriate activity from the effect of the modulator 
on the difference in the rate of cell growth. The 
modulator can be identified by the methods of the 
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The term "inappropriate activity" refers to a PTK 
that regulates a step in a signal transduction process 
at a higher or lower rate than normal cells. 
5 Aberrations in the rate of signal transduction can be 
caused by alterations in the stimulation of a receptor 
PTK by a growth factor, alterations in the activity of 
PTK-specific phosphatase, over- expression of a PTK in a 
cell, or mutations in the catalytic region of the PTK 
10 itself. 

The term "signal transduction process" describes 
the steps in a cascade of events where an extracellular 
signal is transmitted into an intracellular signal. 

The term "PTK-specific phosphatase" describes an 
enzyme that dephosphorylates a particular PTK and 
thereby regulates that PTK's activity. 

Another aspect of the invention is a method of 
treating a disease associated with a PTK with 
inappropriate activity in a cellular organism, where the 
method comprises the steps of: (a) administering the 
modulator of PTK function to the organism, where the 
modulator is in an acceptable pharmaceutical 
preparation; and (b) activating or inhibiting the PTK 
function to treat the disease. 

The term "organism" relates to any living being 
comprised of at least one cell. An organism can be as 
simple as one eukaryotic cell or as complex as a mammal. 

The term "administering", in reference to an 
organism, refers to a method of introducing the compound 
to the organism. The compound can be administered when 
the cells or tissues of the organism exist within the 
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organism or outside of the organic 0.11. existrng 
outside .ha organise can be maintained or grown in cell 
cuiture dishes. For cells harbored within the organise 
rcany techniques exist in the art to administer 
5 compounds, including (but not limited to) oral. 

parenteral, dermal, and inaction applications. For 
ceils outside o£ the patient, multiple technics exist 
in the art to administer the compounds, inciuding (but 
not limited to) cell microinjection techniques, 
10 transformation techniques, and carrier techniques. 

The term "pharmaceutically acceptable composition" 
refers to a preparation comprising the modulator of PTK 
activity. The composition is acceptable if it does not 
appreciably cause irritations to the organism 
15 administered the compound. 

Preferred embodiments of the of the invention are 
that the PTK is a receptor PTK selected from the group 
consisting of FGF-R, PDGF-R, FLK-1, CCK4 , MET, TRKA, 

20 Other preferred embodiments of the invention are that 

pttc selected from the group 
the PTK is a non-receptor PTK sexecteu 

f q*C BRK BTK, CSK, ABL, ZAP70 , FES, FAK, 
consisting of SRC, bk*, 

JAK, and ACK. 

The summary of the invention described above is 
25 non- limiting and other features and advantages of the 
invention will be apparent from the following detaxled 
description, and from the claims. 

BEIEE BESCBIEEIQM of THF EISI2BES 
30 FIG. 1 provides a ribbon diagram of the structure 

of FGFRl showing the side chains of tyrosines Tyr-653 
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cataiytic loop , activatiM iQop fcinase « ^ -p. 

regions of the molecule Th^ ^ ■ 
> and c The , , S termini are den oted by N 

ind , T ° P " ^ ^ 33 iS Ordered, 

indicated by a break in ^ u . 

y break in the chain in this region. 

FIG. 2 provides a stereo view of a r - 
shown in the sam . • ° ^ ° f FGFR1 

tenth —ntation as FIG. lf witn every 

enth ammo acid residue marked with a filled ci J le and 
every twentieth amino acid residue iabeied with a 
residue nuraber. 

FIO. 3 provides a structure-based sequence 
2~ h 0f hUman fibrOMaSt — receptor i 

™ ' h OWaSC 3rOWth f a "° r ~~ 2 

(FGFR3 ) ' h f ibr0tlaSt 9r ° Wth ' BCt0r ^« 3 

FGFR ' T ^ 9r ° Wth — « 

elegans homolog (EGL-15i • 

kinase ,IR K) . lnSUl1 " ™„e 

W«8. 4A and 4B provide ribbon diagram of the 

M1M1 l0teS "*> - — inai lobes «„, of FGFRi 
an IRK in whi ch ehe c „ atQms q£ ^ 

;; G tin tha two protei - ha - — 

Uiustrates the side-chain positions of the 
™ lne —phosphorylation sites of FGFR1 on the 
backbone representation of FGFRl 

"OS. SA and SB are a mi „o acid sequence aliments 
of the cataiytic do ra ains of PTKs . including r J p ^ ^ 
non-receptor type P TKs . FIG . 6fl depicts one 
representative m e m ber fro m each of the eighteen 
subf allies of receptor tyrosine kinases. FIG SB 
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depicts one representative member from each of the 
subfamilies of cytoplasmic tyrosine kinases. In FIGS. 
6A and 6B highly conserved residues are boxed. The 
position of the glycine-rich domain, kinase insert, 
5 catalytic loop, and activation loop are indicated. The 
numbering is for human FGF- receptor . 

B RIEF DESCRIPTION OF THE CRYSTAL LQGRAPH I C ATQMTP 
STRUCTURAL COORDINATES 
10 The crystallographic structural coordinates are 

located at the end of the section entitled "Examples" 
and before the claims. Three sets of coordinates can be 
found in the Protein Data Bank under accession names 
1FGK, 1AGW, and 1FGI. The 1FGK coordinates correspond 
15 to those listed in Table 1, the 1AGW coordinates 

correspond to those listed in Table 4, and the 1FGI 
coodinates correspond to those listed in Table 3. The 
1AGW and 1FGI coordinate sets will be publically 
available in March 1998. 
20 Table 1 provides the atomic structure coordinates 

of native FGFR1 crystals of the invention as determined 
by X-ray crystallography; and 

Table 2 provides the atomic structure coordinates 
of FGFRl :AMP-PCP co-crystals of the invention as 
25 determined by X-ray crystallography. 

Table 3 lists crystallographic coordinates defining 
the three dimensional structure of FGF-R1 complexed with 
3- [ (3- (2-carboxyethyl) -4 -methylpyrrol - 5 -yl) methylene] -2- 
indolinone. The columns (from left to right) are 
3 0 descriptions of the atoms by number and type, amino acid 
and number containing the atom, the x coordinate, y 



8NSD0CID: <WO 9807835A2_I_> 



WO 98/07835 



\\ PCT/US97/14885 



40 



coordinate. , coordinate, bond connec 
temperature factor. All of the. 

^'"".r. are well 

Table 4 is a F-i i ^ 
» d efi„in 3 Che thre a e ^ e ° £ ™ ^aphic coordinates 
9 three dimensional structure of pgf-ri 

described in Table 3. 

The present invention is directed to the „ • 
identification of modulators of o r ^ 

i Tr - - « subfami;; 
uat 1 : u r r conditi °- - — 

design and identification * lUtate 

ntitxcatxon of modulators of ptk f Unrr • 
Protein tyrosine kinases (PTKs) • fu ^xon. 

and diverse class of compro.se a large 

1992 « • Schl —inger and Ullrich 

1992, Neuron 9: 383-391. The PTK f, -i • 
into members that Y 13 Subdiv ^ed 

rS that are receptors and those th»i- 
receptors. The PTK . are n ° n " 

~ - : =t r;i ~r,r,r 

factor receptor (FGF-R) PTK whinh • 

. . FrK whlch is a molecule 

-Plicated i„ re9u ia ting an g io 9 enesis a „eH as cell , 
Proliferation and differentiation Givol , 

»«. wis;.. 33 62 . 3369 GlV ° land — 

F GP - R1 C an m ediates cellular functions by its role 
in one or more ceil,n=»- • le 

3191191 transduction processes 
Cellular signal transduction Drn 

— - ....... .. .» jj:r::,:.7i::: r* 
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intracellular signal. 

Receptor PTK mediated signal transduction is 
initiated by binding a specific extracellular ligand, 
followed by receptor dimerization , and subsequent 
5 autophosphorylation of the receptor PTK. The phosphate 
groups are binding sites for intracellular signal 
transduction molecules which leads to the formation of 
protein complexes at the cell membrane. These complexes 
facilitate an appropriate cellular effect (e.g., cell 
10 division, metabolic effects to the extracellular 

microenvironment) in response to the ligand that began 
the cascade of events. 

Receptor PTKs function as binding sites for several 
intracellular proteins. Intracellular PTK binding 
15 proteins are divided into two principal groups: (1) 
those which harbor a catalytic domain; and (2) those 
which lack such a domain but serve as adapters and 
associate with catalytically active molecules. Songyang 
et al w 1993, Cell 72:767-778 . SH2 (src homology) 
20 domains are common adaptors found in proteins which 
directly bind to the receptor PTK. SH2 domains are 
harbored by PTK binding proteins of both groups 
mentioned above. Fantl et al . , 1992 , Cell £9:413-423 ; 
Songyang et al . , 1994, Mol . Cell. Biol. 14:2777-2785); 
25 Songyang et al . , 1993, Cell 72:767-778; and Koch et al . , 
1991, Science 252:668-678 . 

The specificity of the interactions between 
receptor PTKs and the SH2 domains of their binding 
proteins is determined by the amino acid residues 
3 0 immediately surrounding the phosphorylated tyrosine 

residue. Differences in the binding affinities of SH2 
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domains i s corrpi = h^ 

l - urr e lated with m->^ „. 

Wltn c «e observed ri-i ff av . 

Substrate p hoSphorylacion Q( l " erenCeS ^ 

molecules in fh - • downstream 
in the s.gnal transduction process S „ 
1: 1993. Cell 72:767 . 778 C6SS ' S °"^"3 

= surest that the function of each 

aeter^inea not only by its 1 ^ ""^ P ™ *■ 
Ugand avaUabiUty bul lso T " °' 

Particular receptor. Thus PTRs n aCt1 ^ by a 

regulatory role in • Pr ° Vlde " c ° nt ™Uing 

conseguenl " aut h 19nal tra " Sd -"» Processes as a 
q ence of autophosphorylation 

,~uZ:T:: £ : i9nai — 

- ceils. The re "r entlati ° n - = — 

-suit in a „L e ^^iate PTK activity can 

a "ice array of disorders anH H ■ 
disorders, which are describe* v, dlSeasea ' ™ese 

modulators of PTK f ' ~ X * <™ b - 

- methods discLL he:;: 0 " ^ " * 

The present invention also relates „„ 
Polypeptides corresponding to the cataT t " yStalUne 
receptor tyrosine Unases Such t ^ ° f 

include receptors of a class h at WnaSeS 
oross-lin.ea but are J " that «™ ^ —lently 

-need d im er i2 at I! "^T " ""^ li9a " d - 
Wnases. Pre£erahl " ic tyrosine 

are of suffZ eni "\ ^ 

of a three 11 y '° ^ *"* th * d — ination 

enree -dimensional X-ray diffrar-n 

resolution of about 1 s X to """" t0 3 

J- . b A to about 2.5 a Th. 

also relates to methods for prena ■ Mention 

the polypeptides T h Pre ^ng and crystalli 2ing 

ypeptides. Th e polypeptides themeelv*, 

as information derived from t-h ■ tnemSelVes ' *° well 

from their crystal structures can 
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be used to analyze and modify tyrosine kinase activity 
as well as to identify compounds that interact with the 
catalytic domain. 

The polypeptides of the invention are designed on 
5 the basis of the structure of a region in the 

cytoplasmic domain of the receptor tyrosine kinase that 
contains the catalytic domain. By way of illustration, 
FIG. 6A shows the amino acid sequence alignment of the 
catalytic domains of eighteen human receptor tyrosine 

10 kinases; one representative member from each of the 
eighteen subfamilies is shown. FIG. 6B shows the 
alignment for cytoplasmic kinases. The applicants have 
discovered and determined the boundaries of the domain 
required for crystallization of the resulting 

15 polypeptide. Surprisingly, these boundaries differ from 
that required for catalytic activity. For example, 
referring to FIG. 6A, the domain required for catalytic 
activity is generally believed to span about 7 amino 
acid residues upstream of the first glycine (FIG. 6A 

20 residue number 485) of the N-terminal glycine-rich 

region through about 10 residues beyond the C-terminal 
conserved arginine (FIG. 6A, residue number 744) . 
However, the additional sequence upstream of the N- 
terminal glycine-rich region and downstream of the C- 

25 terminal conserved arginine can be required for 

crystallization. In particular, at least about 20 amino 
acid residues <+/- 5 amino acid residues) upstream of 
the first glycine ( i.e. , FIG. 6A, residue number 485) in 
the conserved glycine-rich region of the catalytic 

30 domain, and at least about 17 amino acid residues (+/- 5 
amino acid residues) downstream of the conserved 
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arginine ( i.e. . FIG sa v~ ^ 

FIG. 6A, residue number 744) located at 

the C- terminal boundary of the catalvHn ^ 

ne catalytic domain can be 
required to engineer a polypeptide suitable for 
crystallization. 

In those situations where the resulting po 
centals cysteine residues that interfere with 
crystamsation (fcJu . cysteine residue ^ 
534 in the FGP . R1 seguence ahown , n ^ 

cysteine residues can he substituted with an appropriate 
-no acid that does not readily f orm covaient bonds 
»«h other amino acid residues under crystallization 
conditions.- ^, by substituting the cysceine 
ser or Gly. ^ cysteine loMt><J ^ ^ ^ 

non- S -stranded segment, based on secondary structure 
assrg nments . are good candidates for replacement 

-ample, cysteines located in regions corresponding to 

the glycine-rich- loop, the kinase insert, the 

3 uxtamembrane region or the activation loop are prime 

candidates for replacement. However, substitutions of 

cysteine residues that ■ are conserved among the kinases 

*4U. BO. 6A at positions 725 and 736, are preferably 
avoided. 1 

Blood vessel proliferative disorders refer to 
angiogenic and vasculogenic disorders generally 
resulting in abnormal proliferation of blood vessels 
The formation and spreading of blood vessels play 
important roles in . variety of physiological 
such as embryonic development, corpus luteum formation 
wound healing and organ regeneration. They also play a 
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pivotal role in cancer development. Other examples of 
blood vessel proliferation disorders include arthritis, 
where new capillary blood vessels invade the joint and 
destroy cartilage, and ocular diseases, like diabetic 
5 retinopathy, where new capillaries in the retina invade 
the vitreous, bleed and cause blindness. Conversely, 
disorders related to the shrinkage, contraction or 
closing of blood vessels are implicated in such diseases 
as restenosis. 

10 Fibrotic disorders refer to the abnormal formation 

of extracellular matrix. Examples of fibrotic disorders 
include hepatic cirrhosis and mesangial cell 
proliferative disorders. Hepatic cirrhosis is 
characterized by the increase in extracellular matrix 

15 constituents resulting in the formation of a hepatic 
scar. Hepatic cirrhosis can cause diseases such as 
cirrhosis of the liver. An increased extracellular 
matrix resulting in a hepatic scar can also be caused by 
viral infection such as hepatitis. 

20 Mesangial cell proliferative disorders refer to 

disorders brought about by abnormal proliferation of 
mesangial cells. Mesangial proliferative disorders 
include various human renal diseases, such as 
glomerulonephritis, diabetic nephropathy, malignant 

25 nephrosclerosis, thrombotic microangiopathy syndromes, 
transplant rejection, and glomerulopathies. The PDGF-R 
has been implicated in the maintenance of mesangial cell 
proliferation. Floege et al . , 1993, Kidney 
International 42:47S-54S. 

30 PTKs are directly associated with the cell 

proliferative disorders described above. For example, 
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some members of 

as SOClated Kith ; h h ; ; eceptor ptk *>- «~ 

these receptors * ^ cancer. Some o£ 

- "OP-R <K Uraa be « aJ ; 992 19 "' »«.7.7- ?la , 
*-v a ted b y autocrine , 7 ^« Persistent ly 

— io „ (Akbasa)t and sZLZT PTK over " 

»enroi. Sci . 2 '* AkbaSi "< « • . 1992. .7. 

&^.249-273; Korc et aJ 199-, , 

Jnvest. <?0:1352-13fim J ' ^ CJil1 - 

J - JO - t: J- 3 60) and autocrine i 

«~ and Oo„og„ ue , „„. ^ " • ^ ^^"^ 

supra, ' Akb3Sak ^ A k b asak et aJ 

-m») account ror the moBt « 

For example, EGFR no severe cancers. 

associated with breast « 

oreast, ovarian, oa^H. i 

end bladder cancer. PDGF „ . * 1U " 9 ' PM «eas 

g »ob las to ma lung 

oencer. The ' Ie Z ' ™ ™* 

The receptor PTK c- me t is generallv 
»«h hepetocarcinogenesis and „ h 9e " erall V aesoc.ated 
— AdditiLaU 1 

::::r:;::i::r d - — - 
^ P - a .\ d ;i;: a zr: - — - 

y ' over- expression of hh. 
gene hes been detected in oaf ■ °" met 

diseese Burkitt „ H ° d3kins 
<**>e, Burkitts disease anrt r-v, -> 

The igf t l^Phoma cell line. 

ine IGF- I receptor ptk ,„ . 

r PTK, in addition to being 
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implicated in nutritional support and in type- I I 
diabetes, is also associated with several types of 
cancers. For example, IGF- I has been implicated as an 
autocrine growth stimulator for several tumor types, 
5 e.g. human breast cancer carcinoma cells (Arteaga et 

al . , 1989, J. Clin. Invest. 54:1418-1423) and small lung 
tumor cells (Macauley et al . , 1990, Cancer Res. 50:2511- 
2517) . In addition, IGF-I, integrally involved in the 
normal growth and differentiation of the nervous system, 

10 appears to be an autocrine stimulator of human gliomas. 
Sandberg-Nordqvist et al . , 1993, Cancer Res. 53:2475- 
2478. The importance of the IGF- IR and its modulators 
in cell proliferation is further supported by the fact 
that many cell types in culture (fibroblasts, epithelial 

15 cells, smooth muscle cells, T- lymphocytes , myeloid 

cells, chondrocytes, osteoblasts, the stem cells of the 
bone marrow) are stimulated to grow by IGF-I. Goldring 
and Goldring, 1991, Eukaryotic Gene Expression 2:301- 
326. In a series of recent publications suggest that 

20 IGF-IR plays a central role in the mechanisms of 

transformation and, as such, could be a preferred target 
for therapeutic interventions for a broad spectrum of 
human malignancies. Baserga, 1995, Cancer Res. 55:249- 
252; Baserga, 1994, Cell 73:927-930; Coppola et al . , 

25 1994, Mol. Cell. Biol. 14:4588-4595. 

The association between abnormalities in receptor 
PTKs and disease are not restricted to cancer, however. 
For example, receptor PTKs are associated with metabolic 
diseases like psoriasis, diabetes mellitus, wound 

30 healing, inflammation, and neurodegenerative diseases. 

EGF-R is indicated in corneal and dermal wound healing. 
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DSfeCtS in msulin-R and i GF . IR 

diabetes mellitus A indicated in type-n 

specific rece Ptor PTKS H T^* ^ tWeen 

pcor PTKs and their ther 3 n Q • 
- ~t forth in p 19 ,! raPeUtiC »«="ion. 

Non-receptor PTKs ■ ' " 94 - ^ 7 ="<-33 9 . 
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regeneration and female reproductive processes such as 
follicle development in the corpus luteum during 
ovulation and placental growth after pregnancy. Folkman 
and Shing, 1992, J. Biological Chem. 267:10931-34. 
5 However, many diseases are driven by persistent 

unregulated or inappropriate angiogenesis . For example, 
in arthritis, new capillary blood vessels invade the 
joint and destroy the cartilage. In diabetes, new 
capillaries in the retina invade the vitreous, bleed and 
10 cause blindness. Folkman, 1987, in: Congress of 

Thrombosis and Haernostasis (Verstraete, et. a 1 , eds . ) , 
Leuven University Press, Leuven, pp. 583-596. Ocular 
neovascularization is the most common cause of blindness 
and dominates approximately twenty (20) eye diseases. 
15 Moreover, vasculogenesis and/or angiogenesis can be 

associated with the growth of malignant solid tumors and 
metastasis. A tumor must continuously stimulate the 
growth of new capillary blood vessels for the tumor 
itself to grow. Furthermore, the new blood vessels 
2 0 embedded in a tumor provide a gateway for tumor cells to 
enter the circulation and to metastasize to distant 
sites in the body. Folkman, 1990, J. Natl. Cancer Inst. 
82:4-6; Klagsbrunn and Soker, 1993, Current Biology 
3:699-702; Folkman, 1991, J. Natl., Cancer Inst. 82:4-6; 
25 Weidner et al . , 1991, New Engl . J. Med. 324:1-5. 

Several polypeptides with in vitro endothelial cell 
growth promoting activity have been identified. 
Examples include acidic and basic fibroblastic growth 
factor (aFGF , (3FGF) , vascular endothelial growth factor 
30 (VEGF) and placental growth factor. Unlike aFGF and 

pFGF, VEGF has recently been reported to be an 
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FGFR2, and FGFR3 , have been implicated in a variety of 
human genetic skeletal disorders. Mutations in FGFR1 
and FGFR2 result in the premature fusion of the flat 
bones of the skull and cause the craniosynostosis 
syndromes, such as Apert (FGFR2) (Wilkie et al . , 1994, 
Wat. Genet. 8:269-274), Pfeiffer (FGFR1 and FGFR2 ) 
(Muenke et al., 1994, Nat. Genet. 8:269-274), 
Jackson-Weiss (FGFR2) (Jabs et al . , 1994, Nat. Genet. 
8:275-279) and Crouzon (FGFR2) (Jabs et al . , 1994, Nat. 
Genet. 8:275-279) syndromes. In contrast mutations in 
FGFR3 are implicated in long bone disorders and cause 
several clinically related forms of dwarfism including 
achondroplasia (Shiang et al . , 1994, Cell 78:335-342), 
hypochondroplasia (Bellus et al . , 1995, Nat. Genet. 
.10:357-359) and the neonatal lethal thanatophoric 
dysplasia (Tavormina et al . , 1995, Nat. Genet. 9:321- 
328) . It has been shown that these mutations lead to 
constitutive activation of the tyrosine kinase activity 
of FGFR3 (Webster et al . , 1996, EMBO J. 15:520-527) . 
Furthermore gene-targeting experiments in mice have 
revealed an essential role for FGFR3 in developmental 
bone formation (Deng et al . , 1996, Cell 84:911-921) . 

Another major role proposed for FGFs in vivo is the 
induction of angiogenesis (Folkman and Klagsbrun, 1987, 
Science 236:442) . Therefore, inappropriate expression 
of FGFs or of their receptors or aberrant function of 
the tyrosine kinase activity could contribute to several 
human angiogenic pathologies such as diabetic 
retinopathy, rheumatoid arthritis, atherosclerosis and 
tumor neovascularization (Klagsbrun and Edelman, 1989, 
Arteriosclerosis 9:269) . Moreover, FGFs are thought to 
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II . Modulators of PTK functions a s Therapeutics for 
Disease 

As a consequence of the disorders discussed above, 
scientists in the biomedical community are searching for 
5 modulators of PTK functions that down- regulate signal 

transduction pathways associated with inappropriate PTK 
activity . 

In particular, small molecule modulators of PTK 
functions are sought as some can traverse the cell 

10 membrane and do not hydrolyze in acidic environments. 
Some compounds have already been discovered. For 
example, bis monocyclic, bicyclic or heterocyclic aryl 
compounds (PCT WO 92/20642) , vinylene-azaindole 
derivatives (PCT WO 94/14808) l-cyclopropyl-4-pyridyl- 

15 quinolones (U.S. Patent No. 5,330,992), styryl compounds 
(U.S. Patent No. 5,217,999), styryl -substituted pyridyl 
compounds (U.S. Patent No. 5,302,606), certain 
quinazoline derivatives (EP Application No. 0 566 266 
Al), seleoindoles and selenides (PCT WO 94/03427), 

20 tricyclic polyhydroxylic compounds (PCT WO 92/21660) , 
and benzylphosphonic acid compounds (PCT WO 91/15495) 
are described as PTK inhibitors. 

Although some modulators of PTK function are known, 
many of these are not specific for PTK subfamilies and 

25 will therefore cause multiple side-effects as 
therapeutics . Compounds of the oxindolinone/ 
thiolindolinone family, however, are specific for the 
FGF receptor subfamily (U.S. Patent Application Serial 
No. 08/702,232, filed August 23, 1996, invented by Tang 

30 et al . , entitled "Indolinone Combinatorial Libraries and 
Related Products and Methods for the Treatment of 
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Disease," Attorney Docket No ^ 

compounds of the oxindolinone/thiolindolinone family 'are 
non-hydroly 2 able in acidic conditions and can be highly 
bioavailable. 

The invention provides information regarding the 
specific interactions between a PTK and compounds of the 
oxxndolinone/thiolindoiinone family. Although the use 
of X-ray crystallography has provided three dimensional 
structures of other PTKs. the PTKs in these structures 
are not complexed with PTK subfamily specific 
hydrolysis resistant, highly bioavailable small 
molecmes. Tne x-ray crystallography technigues used in 
the current invention resolve interactions between a PTK 
and compounds in complex with it at the atomic level 
whrch provides detailed information regarding the 
orientation of chemical groups defining an effective 
modulator of PTK function. 

111 • Crystalline Tv^ =1nr r1n nni 

Crystalline PTKs of the invention include native 
crystals, derivative crystals and co-crystals. The 
native crystals of the invention generally co.prise 
substantially pure polypeptides corresponding to the 
tyrosme kinase domain in crystalline form. 

It is to be understood that the crystalline 
tyrosine Kinase domains of the invention are not limited 
to naturally occurring or native tyrosine kinase 
domains. mdeed, the crystals of the invention include 
mutants of native tyrosine kinase domains. Mutants of 
nattve tyrosine kinase do.ains are obtained by repiactng 
at least one amino acid residue in a native tyrosine 
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kinase domain with a different amino acid residue, or by 
adding or deleting amino acid residues within the native 
polypeptide or at the N- or C-terminus of the native 
polypeptide, and have substantially the same three- 
dimensional structure as the native tyrosine kinase 
domain from which the mutant is derived. 

By having substantially the same three-dimensional 
structure is meant having a set of atomic structure 
coordinates that have a root -mean- square deviation of 
less than or equal to about 2A when superimposed with 
the atomic structure coordinates of the nacive tyrosine 
kinase domain from which the mutant is derived when at 
least about 50% to 100% of the Ca atoms of the native 
tyrosine kinase domain are included in the 
superposition . 

Amino acid substitutions, deletions and additions 
which do not significantly interfere with the three- 
dimensional structure of the tyrosine kinase domain will 
depend, in part, on the region of the tyrosine kinase 
domain where the substitution, addition or deletion 
occurs. In highly variable regions of the molecule, 
such as those shown in FIG. 6, non-conservative 
substitutions as well as conservative substitutions may 
be tolerated without significantly disrupting the three- 
dimensional structure of the molecule. In highly 
conserved regions, or regions containing significant 
secondary structure, such as those regions shown in FIG. 
6, conservative amino acid substitutions are preferred. 

Conservative amino acid substitutions are well- 
known in the art, and include substitutions made on the 
basis of similarity in polarity, charge, solubility, 
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domain will be apparent to those of ordinary skill in 
the art . 

It should be noted that the mutants contemplated 
herein need not exhibit PTK activity. Indeed, amino 
acid substitutions, additions or deletions that 
interfere with the kinase activity of the tyrosine 
kinase domain but which do not significantly alter the 
three-dimensional structure of the domain are 
specifically contemplated by the invention. Such 
crystalline polypeptides, or the atomic structure 
coordinates obtained therefrom, can be used to identify 
compounds that bind to the native domain. These 
compounds may affect the activity or the native domain. 

The derivative crystals of the invention generally 
comprise a crystalline tyrosine kinase domain 
polypeptide in covalent association with one or more 
heavy metal atoms. The polypeptide may correspond to a 
native or a mutated tyrosine kinase domain. Heavy metal 
atoms useful for providing derivative crystals include, 
by way of example and not limitation, gold, mercury, 
etc . 

The co-crystals of the invention generally comprise 
a crystalline tyrosine kinase domain polypeptide in 
association with one or more compounds. The association 
may be covalent or non-covalent . Such compounds 
include, but are not limited to, cofactors, substrates, 
substrate analogues, inhibitors, allosteric effectors, 
etc . 
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transcriptional/translational control signals . These 
methods include in vitro recombinant DNA techniques, 
synthetic techniques and in vivo recombination/genetic 
recombination. See, for example, the techniques 
described in Maniatis et al . , 1989 and Ausubel et al . , 
1989 . 

A variety of host -expression vector systems may be 
utilized to express the tyrosine kinase domain coding 
sequence. These include but are not limited to 
microorganisms such as bacteria transformed with 
recombinant bacteriophage DNA, plasmid DNA or cosmid DNA 
expression vectors containing the tyrosine kinase domain 
coding sequence; yeast transformed with recombinant 
yeast expression vectors containing the tyrosine kinase 
domain coding sequence; insect cell systems infected 
with recombinant virus expression vectors ( e.g. . 
baculovirus) containing the tyrosine kinase domain 
coding sequence; plant cell systems infected with 
recombinant virus expression vectors ( e.g. f cauliflower 
mosaic virus, CaMV; tobacco mosaic virus, TMV) or 
transformed with recombinant plasmid expression vectors 
(e T g ■ / Ti plasmid) containing the tyrosine kinase domain 
coding sequence; or animal cell systems. The expression 
elements of these systems vary in their strength and 
specificities . 

Depending on the host/vector system utilized, any 
of a number of suitable transcription and translation 
elements, including constitutive and inducible 
promoters, may be used in the expression vector. For 
example, when cloning in bacterial systems, inducible 
promoters such as pL of bacteriophage X, plac, ptrp, 
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ptac (ptrp-lac hybrid promoter) and the like may be 
used; when cloning in insect cell systems, promoters 
such as the baculovirus polyhedrin promoter may be used • 
when cloning in plant cell systems, promoters derived 
from the genome of plant cells heat shock 

promoters; the promoter for the small subunit of 
RUBISCO; the promoter for the chlorophyll a/b binding 
protein) or from plant viruses the 35s ^ 

promoter of CaMV; the coat protein promoter of TM V) may 
be used; when cloning in mammalian cell systems 
promoters derived from the genome of mammalian cells 
<a^U, metallothionein promoter) or from mammalian 
viruses the adenovirus late promoter; the 

vaccinia virus 7.5K promoter) may be used; when 
generating cell lines that contain multiple copies of 
the tyrosine kinase domain DMA, SV40-, -fiPV- and EBV- 
based vectors may be used with an appropriate selectable 
marker . 

Methods describing methods of DNA manipulation, 
vectors, various types of cells used, methods of 
incorporating the vectors into the cells, expression 
techniques, protein purification and isolation methods, 
and protein concentration methods are disclosed in 
detail with respect to the protein PYK-2 in PCT 
publication WO 96/18738. This publication is 
incorporated herein by reference in its entirety, 
including any drawings. Those skilled in the art will 
appreciate that such descriptions are applicable to the 
present invention and can be easily adapted to it. 
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crystal growth 

Crystals are grown from an aqueous solution 
containing the purified and concentrated polypeptide by 
a variety of techniques . These techniques include 
batch, liquid, bridge, dialysis, vapor diffusion, and 
hanging drop methods. McPherson, 1982, John Wiley, New 
York; McPherson, 1990, Eur. J. Biochem. 189:1-23; 
Webber, 19 91, Adv. Protein Chem. 41:1-36, incorporated 
by reference herein in its entirety, including all 
figures , tables , and drawings . 

Generally, the native crystals of the invention are 
grown by adding precipitants to the concentrated 
solution of the polypeptide corresponding to the PTK 
catalytic domain. The precipitants are added at a 
concentration just below that necessary to precipitate 
the protein. Water is removed by controlled evaporation 
to produce precipitating conditions, which are 
maintained until crystal growth ceases. 

For crystals of the invention, it has been found 
that hanging drops containing about 2.0 fiL* of tyrosine 
kinase domain polypeptide (10 mg/mL in lOmM Tris-HCl, pH 
8.0, 10 mM NaCl and 2 mM dithiothreitol) and 2 . 0 /iL 
reservoir solution (16% w/v polyethylene glycol MW 
10000, 0.3 M (NH 4 ) 2 S0 4 , 5% v/v ethylene glycol or 
glycerol and 100 mM bis-Tris, pH 6.5) suspended over 0.5 
mL reservoir buffer for about 3-4 weeks at 4°C provide 
crystals suitable for high resolution X-ray structure 
determination . 

Those of ordinary skill in the art will recognize 
that the above -described crystallization conditions can 
be varied. Such variations may be used alone or in 
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combination, and include polypeptide solutions 
containing polypeptide concentrations between about 1 
mg/mL and about so mg/ m L, Tris . Hcl concentrations 
between about 10 m and about 200 *. dithiothreitol 
concentrations between about 0 mM and about 20 m pH 

ranges between about 5.5 and about 7 5 ■ and r.. ' • 

* ana reservoir 
solutions containing polyethylene gl ycol concentrations 
between about 10, and about 30, ,„/„, . polyethylene 
olycol molecuiar weights between about 100 0 and about 
^o.ooo, (m , hS04 concentracions faecween aboufc 

about 0.5 M, ethylene glycol or giycerol concentrations 
between about 0% and about 20, (v/v) , bis-Tris 
concentrations between about 10 „ and about 200 «. p „ 
ranges between about 5 . 5 and about 7 . 5 and temperature 
ranges between about „• C and about 25°c. other buffer 
solutrons may be used such as HE PES buffer, so long as 
the desired pH range is maintained. 

Derivative crystals of the invention can be 
obtained by soaking native crystals in mother liguor 
containing salts of heavy me tal atoms. It has b.en 
found that soaking a native crystal in a solution 
containing about 0.1 m to about 5 w thi m erosal 4- 
chloromeruribenzoic acid or KAu(CN) 2 for about 2 hr to 
about 72 hr provides derivative crystals suitable for 
use as isomorphous repiacements in determining the x-ray 
crystal structure of the tyrosine kinase domain 
polypeptide . 

Co-crystals of the invention can be obtained by 
soaking a native crystal in mother liguor containing 
compound that bind the kinase domain, or described 
above, or can be obtained by co-crystalli 2ing the kinase 
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domain polypeptide in the presence of one or more 
binding compounds . 

For co-crystals of tyrosine kinase domain 
polypeptide in co-complex with AMP-PCP, it has been 
found that co-crystallizing the kinase domain 
polypeptide in the presence of AMP-PCP using the above- 
described crystallization conditions for obtaining 
native crystals with a polypeptide solution additionally 
containing 10 mM AMP-PCP and 20 mM MgCl 2 yields co- 
crystals suitable for the high resolution structure 
determination by X-ray crystallography. Of course, 
those having skill in the art will recognize that the 
concentrations of AMP-PCP and MgCl 2 in the polypeptide 
solution can be varied, alone or in combination with the 
variations described above for native crystals. Such 
variations include polypeptide solutions containing AMP- 
PCP concentrations between 0.1 mM and 50 mM and MgCl 2 
concentrations between 0 mM and 50 mM. 

Crystals comprising a polypeptide corresponding to 
a PTK catalytic domain complexed with a compound can be 
grown by one of two methods. In the first method, the 
modulator is added to the aqueous solution containing 
the polypeptide corresponding to the PTK catalytic 
domain before the crystal is grown. In the second 
method, the modulator is soaked into an already existing 
crystal of a polypeptide corresponding to a PTK 
catalytic domain. 
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that constitute the dimer interface are limited to those 
in the fi-sheet of the N- terminal lobe (amino acid 
residues 477, 479, 498, 506, 508 and 496) . The total 
surface area buried in this interface is about 670 A 2 . 
5 The interactions are rather specific. Three hydrophobic 
residues which are partially solvent -exposed in the 
monomer, Val-479, Ile-498 and Val-508, come together 
with their two- fold-related residues to form a compact 
hydrophobic plug. This plug is capped on either side by 

10 a salt bridge between Arg-477 and Glu-496 . In addition, 
two main-chain hydrogen -bonds connect the (5 -sheets of 
the two monomers at the start of $3 (amino acid residues 
506 and 508) . The residues in this dimer interface, or 
their residue character, are generally conserved in the 

15 mammalian FGF receptors, but not in the invertebrate 
homologues . 

The other crystallographically-related dimer buries 
about 1650 A 2 in its interface. In this dimer, the aC 
helices of the two monomers are nearly parallel and 

20 contact each other at their C- terminal ends. Met -534 

and Met-537 are in van der Waals contact with their two- 
fold-related residues. Other hydrophobic contacts 
involve Pro-466 with Ile-648 and Pro-469 with Ile-676 
and Thr-678. In addition, hydrogen bonds (side-chain to 

25 main-chain) are made between Arg-470 and Lys-618 and 

between His -64 9 and Glu-4 64, and there are several water 
molecules that bridge the two monomers through hydrogen 
bonding. 

In the C2-B form of the crystal, the monomers of 
30 this second crystallographically-related dimer are 
shifted slightly with respect to one another (6° 
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After the X-ray diffraction patterns are collected 
from the crystal, the unit cell dimensions and 
orientation in the crystal can be determined. They can 
be determined from the spacing between the diffraction 
emissions as well as the patterns made from these 
emissions. The unit cell dimensions are characterized 
in three dimensions in units of Angstroms (one A = 10" 10 
meters) and by angles at each vertices. The symmetry of 
the unit cell in the crystals is also characterized at 
this stage. The symmetry of the unit cell in the 
crystal simplifies the complexity of the collected data 
by identifying repeating patterns. Application of the 
symmetry and dimensions of the unit cell is described 
below . 

Each diffraction pattern emission is characterized 
as a vector and the data collected at this stage of the 
method determines the amplitude of each vector. The 
phases of the vectors can be determined using multiple 
techniques. In one method, heavy atoms can be soaked 
into a crystal, a method called isomorphous replacement, 
and the phases of the vectors can be determined by using 
these heavy atoms as reference points in the X-ray 
analysis. Otwinowski , 19 91, Daresbury, United Kingdom, 
80-86. The isomorphous replacement method usually 
requires more than one heavy atom derivative. In 
another method, the amplitudes and phases of vectors 
from a crystalline polypeptide with an already 
determined structure can be applied to the amplitudes of 
the vectors from a crystalline polypeptide of unknown 
structure and consequently determine the phases of these 
vectors. This second method is known as molecular 
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degree of overlapping electron density between a 
theoretical and experimental electron density map. 

The R- factor is then minimized by using computer 
programs that refine the theoretical electron density 
map. A computer program such as X-PLOR can be used for 
model refinement by those skilled in the art . Briinger , 
1992, Nature 355:472-475. Refinement may be achieved in 
an iterative process. A first step can entail altering 
the conformation of atoms defined in an electron density 
map. The conformations of the atoms can be altered by 
simulating a rise in temperature which will increase the 
vibrational frequency of the bonds and modify positions 
of atoms in the structure . At a particular point in the 
atomic perturbation process, a force field, which 
typically defines interactions between atoms in terms of 
allowed bond angles and bond lengths, Van der Waals 
interactions, hydrogen bonds, ionic interactions, and 
hydrophobic interactions, can be applied to the system 
of atoms. Favorable interactions may be described in 
terms of free energy and the atoms can be moved over 
many iterations until a free energy minimum is achieved. 
The refinement process can be iterated until the R- 
f actor reaches a minimum value. 

The three dimensional structure of the molecule or 
molecule complex is described by atoms that fit the 
theoreti cal electron density characterized by a minimum 
R-value. A file can then be created for the three 
dimensional structure that defines each atom by 
coordinates in three dimensions. Examples of such 
structural coordinate files are defined in Table 1, 
Table 2, Table 3, and Table 4. 
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one a-helix (aC) . The C-terminal lobe spans amino acid 
residues 568-765 (FIG. 3) and comprises two ft-strands 
(£7, 38) and seven a-helices (aD, aE, aEF, aF-al) . The 
secondary structure nomenclature follows that used for 
IRK (Hubbard et al . , 1994) which in turn is based on the 
assignments for cAPK (Knighton et al . , 1991). FIG. 2 
shows a stereo view of a C w trace of FGFR1 in the same 
orientation as FIG. 1. 

A structure-based sequence alignment of the 
tyrosine kinase domains of human fibroblast growth 
factor receptor 1 (human FGFR1 ; labelled FGFR1 ) , human 
fibroblast growth factor receptors 2, 3 and 4 (labelled 
FGFR2 , FGFR3 and FGFR4 , respectively), a D. melanogaster 
homologue (labelled DFDFR1) , a C elegans homologue 
(labelled EGL-15) and insulin receptor kinase (labelled 
IRK), is shown in FIG. 3. The sequence of FGFR1 , which 
is not shown in FIG. 3 is identical to the sequence of 
FGFR1 except that FGFR1 has the following amino acid 
substitutions and additions: Cys-488 - Ala, Cys-584 - 
Ser, Leu-457 - Val and an additional five N-terminal 
amino acids (Ser-Ala-Ala-Gly-Thr) . The secondary 
structure assignments for FGFR1 and IRK were obtained 
using the Kabsch and Sander algorithm (Kabsch and 
Sander, 1983) as implemented in PROCHECK (Laskowski et 
al . , 1993). In the FGF receptor sequences, a period 
represents sequence identity to FGFR1 . In the IRK 
sequence, residues that are identical to FGFR1 are 
highlighted. A hyphen denotes an insertion. 

The numbers under the EGL-15 sequence represent the 
fractional solvent accessibility (FSA2) of the residue 
in the FGFR1 structure. The FSA ratio is the ratio of 
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the solvent -accessible surface area of a residue in a 
Gly-X-Gly tripeptide compared to that in the FGFRl 
structure, a value of o represents an FSA between 0.00 
and 0.09; 1 represents an FSA between 0.10 and 0.19, 
etc. The higher the value, the more solvent -exposed the 
resxdue. An asterisk or pound sign in the FSA line 
indicates that the residue (asterisk) or side chain 
(pound sign) is not included in the atom model due to 
disorder. The numbers below the FSA line are the FSAs 
for those residues that form part of a dimer interface. 

The amino acid residue numbers for FGFRl, and hence 
FGFRl , and IRK provided in FIG. 3 are used in the 
discussion that follows. Significant differences in the 
N-terminal lobe of FGFRl as compared to IRK are found in 
the loops between 3 strands and in aC Residues frQm ^ 
end of pi through the beginning of (52 (amino acid 
residues 485-490) form the nucleotide-binding loop, 
named because of its role in ATP coordination. This 
residue stretch contains the protein kinase-conserved 
GXGXXG sequence motif, where X is any amino acid. This 
loop is poorly ordered in one FGFRl molecule in the 
asymmetric unit and disordered (J^ f not included in 
the atomic model) in the other FGFRl molecule in the 
asymmetric unit. The loop between pi and 3 3 is 
disordered in both FGFRl molecules comprising the 
asymmetric unit. 

Referring now to FIG. 4A, which provides a ribbon 
diagram of the N-terminal lobes of FGFRl and IRK in 
which the C„ atoms of the (5-sheets have been 
superimposed, it can be seen that in FGFRl aC is longer 
by one helical turn than in IRK and is oriented such 
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that residues Lys-514 and Glu-531, which are conserved 
in protein kinases, form a salt bridge (represented by a 
black line) . While not intending to be bound by theory, 
this salt bridge is believed to be important for proper 
positioning of the conserved lysine side chain, which 
coordinates two phosphate oxygens of ATP. The salt 
bridge is observed in the structures of cAPK (Knighton 
et al . , 1991) and mitogen-activated protein kinase 
(MAPK) (Zhang et al . , 1994). 

Referring now to FIG. 4B, which provides a ribbon 
diagram of the C- terminal lobes of FGFR1 and IRK in 
which the C 0 atoms of the a-helices have been 
superimposed, a significant difference is found in the 
C- terminal helix of FGFR1 when compared to IRK; helix al 
of FGFR1 is longer by seven residues (two helical turns) 
than its counterpart in IRK. The extended length of al 
is presumably important in the biological functioning of 
FGF receptors, since the tyrosine autophosphorylation 
site to which an SH2 domain of PLCy binds is six 
residues C-terminal to this helix. 

The structure of FGFR1 displays an open disposition 
of the N- and C-terminal lobes. Despite having 
different sets of lattice contacts, the two FGFR1 
molecules in the asymmetric unit have only a 2° 
difference in relative lobe orientation. It appears as 
though the stearic interaction between residues in aC 
(Glu-531 and Met-534) with Phe-642 and Gly-643 of the 
protein kinase-conserved DFG sequence at the beginning 
of the activation loop accounts for the open 
conformation of FGFR1 . 

The active site of FGFR1 is characterized by at 
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HRDLAARN) , and is identical to that for IRK and most 
receptor and non-receptor PTKs . 

In addition to the two tyrosine autophosphorylation 
sites in the activation loop (Tyr-653 and Tyr-654) , 
there are four other autophosphorylation sites present 
in the FGFR1 crystals of the invention : one in the 
juxtamembrane region (Tyr-463) , two in the kinase insert 
(Tyr-583 and Tyr-585) and one in the C-terminal lobe 
(Tyr-730) (Mohammadi et al . , 1996). They exhibit 
varying degrees of conservation in mammalian FGF 
receptors: Tyr-463 and Tyr-585 in FGFR1 and 2; Tyr-583 
in FGFR1 , 2 and 3; and Tyr-730 in FGFR 1, 2, 3 and 4 
(FIG. 3) . 

Referring now to FIG. 5, the positions of the 
autophosphorylation sites are mapped onto the FGFR1 
structure. The juxtamembrane site (Tyr-463) and the 
residues N-terminal to it are disordered in one of the 
FGFR1 molecules in the asymmetric unit. In the other 
molecule in the asymmetric unit Tyr-463 is involved in a 
lattice contact. 

The kinase insert region (the region between 
helices aD and aE) contains autophosphorylation sites 
Tyr-583 and Tyr-585 and is disordered in both FGFR1 
molecules in the asymmetric unit of the C2-A form of the 
crystal. In the C2-B form, several lattice contacts 
partially pin down this region in one of the two FGFRI 
molecules in the asymmetric unit, allowing a trace of 
the polypeptide chain to be made. There is no well- 
defined secondary structure for these residues. Tyr- 
730, situated in aH in the C-terminal lobe, is nearly 
buried and the side-chain hydroxyl group makes two 
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direct hydrogen bonds with protein atoms. Lys-514 is 
hydrogen -bonded to oxygens of the 3" and y-phosphates . 
There is no unambiguous electron density that would 
indicate the positions of Mg^ ions. Generally, AMP-PCP 
appears to be coordinated rather loosely to 
unphosphorylated FGFR1 , being bound to the "roof" of the 
cleft rather than being tightly sandwiched between the 
two kinase lobes . 

Structural Differences Between FGF-R and IRK 

Several features distinguish the FGF-receptor 
structure from that of the insulin-receptor tyrosine 
kinase. These distinctions are likely to be important 
in signaling by FGF-receptors , and other monomeric 
receptors that are believed to undergo ligand- induced 
dimerization . 

The most significant difference between the 
structures of FGFR1 and IRK is the conformation of the 
activation loop. In FGFR1 , the activation loop is 
disposed such that the binding site for substrate 
peptides is blocked not by an activation loop tyrosine, 
as in IRK, but by Arg-661 and PTK-invariant Pro-663, 
while the ATP binding site is accessible. This 
represents another molecular mechanism by which a 
receptor PTK may be autoinhibited . The observed 
autoinhibition in FGFR1 would appear to be weaker than 
that in IRK because of fewer specific interactions made 
by residues in the FGFR1 activation loop (manifested in 
the relatively higher B-values) and the accessibility of 
the ATP site. One obvious distinction between the 
insulin and FGF receptor families is that in the former, 
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structure coordinates, as well as models of the three- 
dimensional structures obtained therefrom, can also be 
used to aid the elucidation of solution-based structures 
of native or mutated tyrosine kinase domains, such as 
5 those obtained via NMR . Thus, the crystals and atomic 
structure coordinates of the invention provide a 
convenient means for elucidating the structures and 
functions of receptor and non-receptor tyrosine kinases. 
For purposes of clarity and discussion, the 

10 crystals of the invention will be described by reference 
to specific FGFR1 exemplary crystals. Those skilled in 
the art will appreciate that the principles described 
herein are generally applicable to crystals of the 
tyrosine kinase domain of any cytoplasmic tyrosine 

15 kinase that undergoes ligand- induced dimerization or 

receptor tyrosine kinase, including but not limited to 
the tyrosine kinases of FIG. 6. 

vii. Structure Determination £or PTKs with unknown 
20 Structure Using Structural Coordinates 

Structural coordinates, such as those set forth in 
Table 1, Table 2, Table 3, and Table 4, can be used to 
determine the three dimensional structures of PTKs with 
unknown structure. The methods described below can 

25 apply structural coordinates of a polypeptide with known 
structure to another data set, such as an amino acid 
sequence, X-ray crystal lographic diffraction data, or 
nuclear magnetic resonance (NMR) data. Preferred 
embodiments of the invention relate to determining the 

30 three dimensional structures of PTKs and related 

polypeptides. These include receptor PTKs such as FGF- 
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those skilled in the art is the Homology module in the 
Insight II modeling package distributed by Molecular 
Simulations Inc. 

Alignment of the amino acid sequence is 
accomplished by first placing the computer 
representation of the amino acid sequence of a 
polypeptide with known structure above the amino acid 
sequence of the polypeptide of unknown structure. Amino 
acids in the sequences are then compared and groups of 
amino acids that are homologous (e.g., amino acid side 
chains that are similar in chemical nature - aliphatic, 
aromatic, polar, or charged) are grouped together. This 
method will detect conserved regions of the polypeptides 
and account for amino acid insertions or deletions. 

Once the amino acid sequences of the polypeptides 
with known and unknown structures are aligned, the 
structures of the conserved amino acids in the computer 
representation of the polypeptide with known structure 
are transferred to the corresponding amino acids of the 
polypeptide whose structure is unknown. For example, a 
tyrosine in the amino acid sequence of known structure 
may be replaced by a phenylalanine, the corresponding 
homologous amino acid in the amino acid sequence of 
unknown structure. 

The structures of amino acids located in non- 
conserved regions are to be assigned manually by either 
using standard peptide geometries or molecular 
simulation techniques, such as molecular dynamics. The 
final step in the process is accomplished by refining 
the entire structure using molecular dynamics and/or 
energy minimization. 
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step can be to translate the electron density in three 
dimensions to correctly position the molecules in the 
unit cell. Once the amplitudes and phases of the X-ray 
diffraction data is determined, an f?-factor can be 
5 calculated by comparing electron diffraction maps 

calculated experimentally from the reference data set 
and calculated from the new data set. An R- factor 
between 30-50% indicates that the orientations of the 
atoms in the unit cell are reasonably determined by this 
10 method. A fourth step in the process can be to decrease 
the R-factor to roughly 20% by refining the new electron 
density map using iterative refinement techniques 
described herein and known to those or ordinary skill in 
the art . 

15 

Structures using NMR Data 

Structural coordinates of a polypeptide or 
polypeptide complex derived from X-ray crystallographic 
techniques can be applied towards the elucidation of 

20 three dimensional structures of polypeptides from 

nuclear magnetic resonance (NMR) data. This method is 
used by those skilled in the art. Wuthrich, 1986, John 
Wiley and Sons, New York : 176 -199 ; Pflugrath et a J . , 
1986, J. Molecular Biology 189:383-386 ; Kline et al . , 

25 1986, J. Molecular Biology 185:377-382 . While the 

secondary structure of a polypeptide is often readily 
determined by utilizing two-dimensional NMR data, the 
spatial connections between individual pieces of 
secondary structure are not as readily determinable. 

30 The coordinates defining a three-dimensional structure 
of a polypeptide derived from X-ray crystallographic 
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techniques can guide the NMR spectroscopist to an 
understanding of these spatial interactions between 
secondary structural elements in a polypeptide of 
related structure. 

The knowledge of spatial interactions between 
secondary structural elements can greatly simplify 
Nuclear Overhauser Effect (NOE) data from two- 
dimensional NMR experiments. Additionally, apply ing the 
crystallographic coordinates after the determination of 
secondary structure by NMR techniques only simplifies 
the assignment of NOEs relating to particular amino 
acids in the polypeptide sequence and does not greatly 
bias the NMR analysis of polypeptide structure. 
Conversely, using the crystallographic coordinates to 
simplify NOE data while determining secondary structure 
of the polypeptide would bias the NMR analysis of 
protein structure. 

As the analysis of polypeptide structure by NMR 
methods is a relatively new technique, the use of 
structural coordinates defining a PTK structure will 
most likely be utilized more frequently in the near 
future. as the method progresses, the three dimensional 
structure analysis of polypeptides of the same size as a 
PTK catalytic domain will become more frequent. 
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groups. The computerized design and identification of 
modulators is useful as the computer data bases contain 
more compounds than the chemical libraries, often by an 
order of magnitude. For reviews of structure-based drug 
design and identification see Kuntz et al . , 1994, Acc. 
Chem. Res. 27-. 117; Guida, 19 94, Current Opinion in 
Struc. Biol. 4: 777; Colman, 1994, Current Opinion in 
Struc. Biol. 4: 868. 

The three dimensional structure of a polypeptide 
defined by structural coordinates can be utilized by 
these design methods. The structural coordinates of 
Table 1, Table 2, Table 3, and Table 4 can be utilized 
by this method. In addition, the three dimensional 
structures of receptor and non-receptor PTKs determined 
by the homology, molecular replacement, and NMR 
techniques described herein can also be applied to 
modulator design and identification methods. Thus, the 
structures of receptor PTKs, FGF-R, PDGF-R, FLK, CCK4 , 
MET, TRKA, AXL, TIE, EPH, RYK, DDR, ROS , RET, LTK, R0R1 , 
and MUSK, can be utilized by the methods described 
herein. The structures of non-receptor PTKs, SRC, BRK, 
BTK, CSK, ABL, ZAP70, FES, FAK , JAK, and ACK, can also 
be utilized by the rational modulator design method. 

Design by Searching Molecular Data Bases 

One method of rational modulator design searches 
for modulators by docking the computer representation of 
compounds from a data base of molecules. Publicly 
available data bases include: 



a) ACD from Molecular Designs Limited 
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b) 
c) 
d) 
e) 
f ) 

g) 

h) 
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NCI from National Cancer Institute 

CCDC from Cambridae Crvtst-aii^ 

9 Lr ystal lographic Data Center 

CAST from Chemical Abstract Service 
Derwent from Derwent Information Limited 
Abridge from Maybridge Chemical Company LTD 
Aldrich from Aldrich Chemical Company 
Directory of Natural Products from Chapman & Hall 

One such data base (AC D distributed by Molecular Designs 
I 1 ; 1 " lnf °— " ~> stains, for example, 
200,000 compounds that are synthetically derived or are 
natural products. Methods available to those skilled in 
the art can convert a data set represented in two 
dimensions to one represented in three dimensions 

CONc S o R n meth0dS 91:6 ^ SUCh ™~ ™- as 

CONCOHD from Tripos Associates or DB-Converter from 

Molecular Simulations Limited. 

Multiple methods of structure-based modulator 
design are known to those in the art. Kuntz et al 
"82. J. Mol . sioi . 162: 269; ^ ^ ' 

ell C T Res ' 271 117; Men9 et ai - i992 - - — 

Chem. 13; 505 B ohm 19Q4 t ^ 

g 623 0hm ' 1994 ' J - c °"*>- AWed Moiec. Design 

A computer progrsm „ idely utilized by those sk . ued 
» the art ot racional moduUtor design ^ ^ 
Unrv icy of Cali£ornia ^ pranciaco ^ 

-thod. utili 2e d by this co „ puCer pro3rams 
W» lt are described i„ three applications belo „ ^ 
detaUed info r mati o„ regarding some of these techniques 
can be foun d in the Molecular Simulations User Guide 

1995. ' 
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A typical computer program used for this purpose 
can comprise the following steps: 

(a) remove the existing compound from the protein; 

(b) dock the structure of another compound into 
the active-site using the computer program (such as 
DOCK) or by interactively moving the compound into the 
active- site ; 

(c) characterize the space between the compound 
and the active -site atoms; 

(d) search libraries for molecular fragments which 
(i)can fit into the empty space between the compound and 
the active-site, and (ii) can be linked to the compound; 
and 

(e) link the fragments found above to the compound 
and evaluate the new modified compound. 

Part (c) refers to characterizing the geometry and 
the complementary interactions formed between the atoms 
of the active-site and the compounds. A favorable 
geometric fit is attained when a significant surface 
area is shared between the compound and active- site 
atoms without forming unfavorable steric interactions. 

One skilled in the art would note that the method 
can be performed by skipping parts (d) and (e) and 
screening a data base of many compounds . 

Structure-based design and identification of 
modulators of PTK function can be used in conjunction 
with assay screening. As large computer data base of 
compounds (around 10,000 compounds) can be searched in a 
matter of hours, the computer based method can narrow 
the compounds tested as potential modulators of PTK 
function in ceilular assays. 
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The above descriptions of structure-based modulator 
design are not all encompassing and other methods are 
reported in the literature: 

(1) CAVEAT: Bartlett et al., 1989, in .-chemical and 
Biological Problems in Molecular Recognition", Roberts, 
S.M.; Ley, S.V.; Campbell, M.M. eds . ; Royal Society of 
Chemistry: Cambridge, ppl82-l96. 

(2) FLOG: Miller et al . , 1994, J. C omp . Aided 
Molec. Design 8:153. 

(3) pro Modulator: Clark et al . , 1995, J. Comp. 
Aided Molec. Design 9:13. 

(4) MCSS: Miranker and Karplus, 1991, Proteins: 
Structure, Function, and Genetics 11:29. 

(5) AUTODOCK: Goodsell and Olson, 1990, Proteins: 
Structure, Function, and Genetics 8:195. 

(6) GRID: Goodford, 1985, J. Med. Chem. 28:849. 

Des i gn by Modifyin g Compoun ds -in r-nm P i Py D ^ 

Another way of identifying compounds as potential 
modulators is to modify an existing modulator in the 
polypeptide active-site. For example, the computer 
representation of modulators can be modified within the 
computer representation of a PTK active-site. Detailed 
instructions for this technique can be found in the 
Molecular Simulations User Manual, 1995 in LUDI . The 
computer representation of the modulator is modified by 
the deletion of a chemical group or groups or by the 
addition of a chemical group or groups. 

Upon each modification to the compound, the atoms 
of the modified compound and active-site can be shifted 
in conformation and the distance between the modulator 



WO 98/07835 ( 



PCT/US97/14885 



89 

and the active-site atoms may be scored along with any 
complimentary interactions formed between the two 
molecules. Scoring can be complete when a favorable 
geometric fit and favorable complementary interactions 
are attained. Compounds that have favorable scores are 
potential modulators of PTK function. 

Resign bv Modifying the Structure of Compounds that Bind 
PTKs 

A third method of structure-based modulator design 
is to screen compounds designed by a modulator building 
or modulator searching computer program. Examples of 
these types of programs can be found in the Molecular 
Simulations Package, Catalyst. Descriptions for using 
this program are documented in the Molecular Simulations 
User Guide (1995) . Other computer programs used in this 
application are ISIS/HOST, ISIS/BASE , ISIS/DRAW) from 
Molecular Designs Limited and UNITY from Tripos 
Associates . 

These programs can be operated on the structure of 
a compound that has been removed from the active -site of 
the three dimensional structure of a compound- PTK 
complex. Operating the program on such a compound is 
preferable since it is in a biologically active 
conformation. 

A modulator construction computer program is a 
computer program that may be used to replace computer 
representations of chemical groups in a compound 
complexed with a PTK with groups from a computer data 
base. A modulator searching computer program is a 
computer program that may be used to search computer 
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functional groups. A consequence of this chemical 
diversity is that a potential modulator of PTK function 
may take a chemical form that is not predictable. A 
wide array of organic synthetic techniques exist in the 
art to meet the challenge of constructing these 
potential modulators of PTK function. Many of these 
organic synthetic methods are described in detail in 
standard reference sources utilized by those skilled in 
the art. One example of such a reference is March, 
1994, Advanced Organic Chemistry; Reactions. Mec h anisms, 
and Structure . New York, McGraw Hill. Thus, the 
techniques required to synthesize a potential modulator 
of PTK function identified by computer-based methods are 
readily available to those skilled in the art of organic 
chemical synthesis . 

X. Cellular Assays Measuring the Effect of a ptk 
Modulator in Signal Transduction Pathways 

Cellular assays can be used to test the activity of 
a potential modulator of PTK function as well as 
diagnose a disease associated with inappropriate PTK 
activity. A potential modulator of PTK function can be 
tested for activity in vitro by assays that measure the 
effect of a potential modulator on the 

autophosphorylation of a particular PTK over-expressed 
in a cell line. Thus, a modulator that acts as a potent 
inhibitor of the catalytic domain corresponding to a PTK 
would decrease the amount of autophosphorylation 
catalyzed by that PTK. Potential modulators could also 
be tested for activity in cell growth assays in vitro as 
well as in animal model assays in vivo. 
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in v ivo assays are also useful for testing the 
bioactivity of a potential modulator designed by the 
methods of the invention. 

Materials, methods, and experimental data for these 
assays are fully described in WO 96/40116 published Qn 
December i 9 , 1996 , entitled <>Indolinone ^ 

the Treatment of Disease". This application is 
incorporated herein by reference in its entirety 
^eluding all drawings, f igure s, and tables . 

Methods of administering compounds to organisms as 
therapeutics for disease are fully described in WO 
96/40116 published on December 19, 1996 , entitled 
•indolinone Compounds for the Treatment of Disease" 
Thas application is incorporated herein by reference in 
its entirety, including all drawings, fi gures , and 
tables . 

The examples below are non-limiting and are merely 
representative of various aspects and features of the 
Present invention. The examples provide illustrative 
methods for obtaining crystalline forms of protein 
kinase polypeptides, methods for determining three 
dimensional structures of these protein kinase 
Polypeptides, and methods for identifying modulators of 

protein kinases usino t-he j • 

using the three dimensional structures 

of the protein kinases. 
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EXAMPLE 1 : X-ray Crystalloaraphic Structure 

Peterminatipn of FGFR1 

Polypeptide Synthesis anfl Isolation 

A recombinant baculovirus was engineered to encode 
residues 456-765 of human FGFR1\ A cleavable N-terminal 
histidine tag was incorporated to aid in protein 
purification. Three amino acid substitutions were 
introduced: Cys-488 to Ala, Cys-584 to Ser and Leu-457 
to Val . The two cysteine substitutions were made to 
prevent the formation of disulfide- linked oligomers, 
which occurs for the native protein. The substitution 
Leu-457 to Val introduced a Ncol cloning site near Met- 
456. The codon for Tyr-766 (TAC) was changed to a stop 
codon (TAG) and a HindXJJ-cloning site was generated 
following this stop codon. These substitutions were 
introduced into the full length human cDNA of FGFR1 in 
ml3MPI9 by site-directed mutagenesis according to the 
manufacturer's protocol (Amersham) . 

The resulting construct was digested with Ncol and 
Hindlll and was ligated into appropriately digested 
pBlueBac HistagB (Invitrogen) . Transfection of insect 
cells (Sf9) was performed with the BaculoGold 
transfection system according to the manufacturer's 
protocol (Pharmingen) . Following identification of 
positive plaques, the recombinant baculovirus was 
amplified to high titer (5xl0 7 virus particles/ml) . Sf9 
cells were grown in 175 -cm 2 flasks to a density of 2- 
3xl0 7 per flask and infected with recombinant baculovirus 
with a multiplicity of infection (MOI) of 10. 

After 48 hr, cells were harvested by centrif ugation 
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" 3.000g for 35 min at 4 °c and then lysed 25 
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Crystal "m^ffr 

Purified FGFRi was concentrated to 20 
exchanged into 10 mM Tris-HCl (p H an 

S HC1 (pH 8 -°>< 10 mM NaCl, and 
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2 mM DTT using a Centricon-30 . Crystals were grown at 
4°C by vapor diffusion in hanging drops containing 2.0 
pi of 10 mg/ml protein solution and 2.0 /xl of reservoir 
solution: 16% polyethylene glycol (PEG) 10000, 0.3 M 
(NH 4 ) 3 SO<, 5% ethylene glycol, and 100 mM bis-Tris <pH 
6.5) . 

Crystals of native FGFR1 were soaked in 500 ml 
stabilizing solution [25% PEG 10000, 0.3 M (NH4) 2 SQ 4 , 0.1 
M Bis-Tris (pH 6.5), 5% ethylene glycol] containing 3- 
[ (3- (2-carboxyethyl) -4 -methylpyrrol - 5 -yl ) methylene] -2- 
indolinone (1-5 mM) or 3- [4- (4-formylpiperazine-l-yl ) - 
benzylidenyl] -2- indolinone (1 mM) at 4°C for 24 to 48 
hours. The final soaking concentration of DMSO was 
between 1 to 5%. The crystals cracked at higher 
concentrations of DMSO. 

Co-crystals of FGFR1 with the inhibitors could also 
be obtained by vapor diffusion in hanging drops 
containing 2.0 /xl of 10 mg/ml protein solution and 2.0 
li\ of reservoir solution containing 1 mM 3- [(3- (2- 
carboxyethyl) -4 -methylpyrrol- 5 -yl) methylene] -2- 
indolinone and 3 - [4 - (4 - f ormylpiperazine- 1 -yl- 
) benzylidenyl] - 2-indolinone . 

Co-crystals of FGFR1 complexed with AMP-PCP were 
obtained as described for the creation of native 
crystals, except that the protein solution additionally 
contained 10 mM AMP-PCP and 20 mM MgCl 2 . 

Preparation Of Heavy Atom Derivati ve Crystals 

Heavy atom derivative crystals were obtained by 
soaking FGFR1 native crystals (C2-A form) in a solution 
containing ethylmercurithiosalicylic acid (thimerosal) , 



WO 98/07835 



PCT/US97/14885 



«"<«>, or 4-CUoro m ercurioen Z oic acid ae 

Table i, in/ra , . C aCld ' *s Provided in 

IPH 6.5), and were flash-cooled either 
UqUld — -synchrotron, or I d 

" ltr ° 9en ——"C .rotatin, anode, " 

for native crystals and crystal „ , 

" nucleotide analo 9 A„p.p cp dat C « P "=^ «- 

a *i g a ku RU . 200 rotatin 7' "» «»-ted either on 

^Lacing anode onpr^^ 

- -u *», and equipped „ lth do"™ s " "* 

« «-«XS IIC i„a 9 e plate detecto or a b T ^ 
" the Kational Svnchrotron Ligh t Sou " 
» National Laboratory synchrot B ™ve„ 
collected on Fuii 1 ^"^^ ««.07A> were 

°n Fuji lma ge plates and read with a p ■ • 
scanner, one cryo-cool^ 51 
"» aata sets Z Z f ~ <* 

-re soa^d L a cr Cry °" C °° Ud 

-™a ndl oo«;r;; i r:T;r 9iycoior 

cooled either in l ■ ' ^ WSre flas ^ 

eicner m liquid nitrogen dir^M , 
data) or in a rt. Erectly (synchrotron 

n 3 dry ni trogen stream at -i 75 °C .rot- 
anode data). All A,*. Rotating 

SCALEPACK ot DEI ^ and 

reduction " 1993 ' ^"^ion data 

eduction program," Proceedings of the CCP4 St „ 
Weekend, Sawyer et al ^ , ^ 
«C .areshury ^orat^t . ,7^' ^ ^ 

•0 nucleo'tTd 11 " 1 " CryStalS ™^ th. 

nucleotide analog amp-pcp * m «i 

a f<_f, a molecular reniar.^ 

solution was found initiallv , , re Piacemen t 

initially for the C2 . B crysfcai 



BNSDOCID:<WO_9 8 07835A2_I > 



WO 98/07835 { PCT/US97/14885 

97 

using an IRK search model that consisted of polyalanine 
with the common side chains for residues 993-1263 (FGFR1 
residues 475-754) , excluding residues 1094-1105 (kinase 
insert) and 1153-1170 (activation loop) . With AMORE 
5 (Navaza, 1994, AmoRe : an automated package for molecular 

replacement," Acta Crystallogr. A50: 157-163), using 80% 
of the structure factor amplitudes between 15.0 and 3.5 
A, one of the two molecules in the asymmetric unit was 
located. The correlation coefficient (c.c.) for the 

10 correct 1-molecule solution was 0.23 (versus 0.20 for 
the highest incorrect solution) . This molecule was 
rigid body-refined in X-PLOR (Briinger, 1992, X-PLOR 
(Version 3.1) Manual (New Haven, Conneticut: The Howeard 
Hughes Medical Institute and Department, of Molecular 

15 Biophysics and Biochemistry, Yale Uiversity) ) , first as 
one rigid body unit, then as two units each comprising a 
lobe of the kinase. Rigid body refinement (12.0-3.5 A, 
F>3o) resulted in a relative rotation of the two lobes 
of -10° and an increase of the c.c. from 0.20 to 0.25. 

20 The rigid body-refined molecule was then used as a new 
search model in AMORE, and this time both molecules in 
the asymmetric unit were located. The c.c. for the 
correct 2 -molecule solution was 0.35 (versus 0.27 for 
the highest incorrect solution) . 

25 Multiple cycles of model building and refinement 

against 6.0-2.4 A data resulted in the addition to the 
model of many of the side chains and some of the missing 
polypeptide chain. Model building was performed using 
TOM/FRODO (Jones, 1985, "Diffraction methods for 

3 0 biological macromolecules . Interactive computer 

graphics: FRODO, " Methods in Enzymology 115: 157-171) 
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flattening, histogram matching, and non-crystallographic 
symmetry (NCS) averaging using DM (Cowtan, 1994, 
"Protein Crystallography," CCP4 and ESF-EACBM Newsletter 
(joint) 31 : 34-38) . 

Refinement of the C2-A FGFR1 structure against 6.0- 
2.0 A data proceeded by conjugate-gradient minimization 
and simulated annealing using X-PLOR. Tight NCS 
restraints were imposed until data to 2 . 0 A resolution 
were included in the refinement, at which point the 
restraints were lifted. An overall anisotropic B-value 
was calculated using X-PLOR and applied to the observed 
structure factors, reducing the R- value by -3%. Water 
molecules whose B-values refined to £70 A 2 were omitted 
from the subsequent refinement round. The average B- 
value is 37.5 A 2 for all protein atoms, 35.4 A 2 for 
protein atoms in FLGK-A, 3 9.7 A 2 for protein atoms in 
FLGK-B, and 4 0.2 A 2 for water molecules. The side chains 
for Cys-603 in FLGK-A and FLGK-B and for Met -534 in 
FLGK-B have been modeled in two different conformations. 
Residues that are not included in the atomic model due 
to poor supporting electron density are for FLGK-A: 456- 
463, 486-490, 501-504, 580-591, 763-765; and for FLG-B: 
456-460, 501-504, 578-593, 646-651, 657-659, 762-765. 

The positions of the two AMP-PCP molecules (one per 
FGFR1 molecule) were easily identified in 2F oba(co . complex) - 
Fcaic<rGFim difference Fourier maps. The AMP-PCP molecule 
bound to FLGK-B is less tightly bound and has been 
modeled with an occupancy of 0.5. 

Table A summarizes the X-ray crystallography data 
sets of FGFR1 derivative crystals that were used to 
determine the structures of crystalline FGFR1 and 



WO 98/07835 



PCIYUS97/14885 



100 



10 



15 



20 



25 



30 



crystamne FGFRi :AMP-PCP co-convex of Che invention. 

TABLES 




X-ray source 
Resolution limit (A) 
Number of sites 
Cone. (mM)/time (h) 

R,ym b (%) 

Total observations 
Unique reflections 

Completeness (%) 

Signal (%l>3o) 

R 1$ o e (%) 

Phasing power f 



4.8(1 9. 7) c 
122569 
50771 
97.3(96.3) c 
80.7(50.3) c 



4.5(23.3) c 
91324 
31997 
95.5(93.7) c 
79.6(5 l.7) e 



Thi-P 
RU-200 
2.6 
4 

0.1/24 

5.5 
55456 
42820 d 
95.0 
69.8 



Thi-2 a 
RU-200 
2.8 
7 

0.1/48 

9.8 
59488 
35538" 
96.7 
66.8 



PCMB* 
RU-200 
28 

2 

0.2/2 
6.8 
67988 
18619 
98.0 
84.7 



KAu(CN) ; 
RU-200 
2.8 
2 

5.0/72 

6.8 
45303 
18202 
97.7 
77.6 





17.1 


31.2 


15.4 


15.2 




1.8 


2.0 


1.0 


0.9 




0.55 


0.50 


0.81 


0.84 



0.60 
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For crystals comprising FGFR1 and compounds 1 and 
2, data were collected on a Rigaku RU-200 rotating anode 
(Cu Ka) operating at 5 0 kV and 100 mA and equipped with 
double- focusing mirrors and an R-AXIS IIC image plate 
detector. One cryo-cooled crystal was used for each of 
the data sets. Crystals were soaked in a 
cryo-protectant [25% PEG 10000, 0.3 M (NH,),SO«, 5% 
ethylene glycol, 100 mM bis-Tris (pH 6.5), and 1 mM: 3- 
[ (3- (2-carboxyethyl) -4 -methylpyrrol-5-yl ) methylene] -2- 
indolinone (hereafter referred to as compound 1) or 3- 
[4- (4 -f ormylpiperazine-l-yl- ) benzylidenyl] -2-indolinone 
(hereafter referred to as compound 2) and flash-cooled 
in a dry nitrogen stream at -175°C. Data were processed 
using DENZO and SCALEPACK. Otwinowski, 19 93, 
Proceedings of the CCP4 Study Weekend (Daresbury, United 
Kingdom: SERC Daresbury Laboratory) pp 56-62. 

A summary of the data collection parameters are 
included in the following Table 6: 



IABLE-6 





Resolution 
limit (A) 


Observa- 
tions (N) 


Complete- 
ness (%) 


Redundan- 
cy 


Rsym' (%) 


Signal 
(I> ol) 


compound 
1 


2.5 


93535 


97.6 (96.1) 


2.7 


6.8 

(23.0) 


11.8 


compound 

2 


2.4 


94093 


99.1 (97.9) 


3.3 


6.3 

(32.2) 


11.4 



compound 1 structure: 550 residues, 252 water molecules, 2 compound 1 molecules (4589 atoms) 
compound 2 structure: 550 residues, 248 water molecules, 2 compound 2 molecules (4646 atoms) 



Structure Analyses 

Atomic superpositions were performed with TOSS 
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(Hendrickson, 19701 D 

„ ?9) - Per ^sidue solvent accessible 

surface calr-ni • «<-<-essi£>ie 

■~ b j r - e d ° ne — - surface 

1.4 A Th*. 9 3 Pr ° be "dins Of 

The stereochemical quality of 
was ,„„ lto „, , . y ° f Che atomic model 

omtored using PROCHECK (Laskowski et al 
™CHECK: a computer program to check the t 
quality of protein structures - , 

»»■ - defined in proche ' 93 J of h ^ 283 " 
«- -*1 «~ main-chain ^^^nT" ^ 
^vored Ramachandran regions. Th ere are I 
disaUowed regions, and three residu " 
allowed regions, Ar g 622 . " " 9 — °"=ly ' 

«« in FLGK-A The & ^ "^"» — 

Tabie 7 " £aCt ° r 

7 summarises the x-ray crystallography 
refinement parameters of the structures of 
fgfri and crystal n ructures of crystalline 

crystalline FGFRI :AMP-PCP m ,-, , 
invention. Table 3 summarizes the x 1 " " ^ 

refinement parameters for the fcfrJ 

he FGFR 1 /compound complexes. 



25 



30 



I&BLE2 




FGFR J 550 residues, 252 water moI ecu,es (4589 atoms) 
FGFRl :A MP-PCP : 550residues,238watermolecu,es2AMPPCP m o, . , 




R.m.s.d. 



6.0-2.0 

_FGFRI:AMP-PCP 60 -2.3 



bonds (A) angles (•) B-values> 

«548 21.3(26.2)' 0.008 u TT~ 

26729 20.1(27.5)' 



w »th F obi >2o. 
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b For bonded protein atoms. 

c Value in parentheses is the free R-value (Brunger, 1993) determined from 5% of the data. 

TABLE 8 



Model 


d-spacings (A) 


Reflec- 
tions 


R- 

valuc' (N) 


bonds (A) 


angles (°) 


B- 

values' 
(A : ) 


compound 
1 


6.0-2.4 


42548 


19.7 
(27.0) k 


0.008 


13 


1.6 


compound 

2 


6.0-2.5 


26729 


20.0 
(28.0) k 


0.008 


1.4 


1.7 



'R^ - 100 x S h S, |l s (h) - I(h) 6 | / S h Sj IXh) 

c Value in parentheses is for the highest resolution shell. 

'R-value = 100 x S h ||F D (h)| - |F c (h)|| / S h |F 0 (h)|, where F 0 and F c arc the observed and calculated 
structure factors, respectively (F 0 > 2s). 
'For bonded protein atoms. 

k Value in parentheses is the free R-valuc determined from 5% of the data. 

Atomic Structural CoordinatPR 

Tables 1 and 2 provide the atomic structural 
coordinates of unphosphorylated FGFR1 and 

unphosphorylated FGFR1 : AMP-PCP co-complex, respectively. 
In the Tables, coordinates for both of the FGFR1 
molecules of the dimer comprising the asymmetric unit 
are provided. The amino acid residue numbers coincide 
with those used in FIG. 3. In the first FGFR1 molecule 
of the dimer the residue number is preceded by a 1, 
i -^i , residue number 4 64 of the first FGFR1 molecule of 
the dimer is denoted by M 1464" . Tables 3 and 4 provide 
the atomic structural coordinates of FGFR1 in complex 
with indolinone compounds found to inhibit FGFR1 
function . 
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The foH owing abbreviations ^ 
A**^. -fers to the elemenfc 
are provided. The first x coordxnates 
the element. * C ° 1Umn defines 

"iLJU" refers to amino acid. 

K-and-Z" provide the r*^ 
the element. Cartesran coordinates of 

the 21 ^ 3 thermSl faCC ° r Ch " ™es m ove m ent of 
cne atom around itc- 

u ics atomic center. 

peroenT" ° CC ™' — represents the 

of 0 :r: a ;r atom type — - — 

OCC values range from 0 to l wi , h , . . 
100%. Wlth 1 being 

-aa- or . w . relate to occ 

desrgnating the coordinates of the ato„ I 

^ cne atom when in fho 
first conformation and PRT2 *.„• • 

of the ato m u . ^gnating the coordinates 

the atom when ln the second or alternate 
conformation. 

Structural coordinates for fgprt » u 

- rr -mrrr.- 

subtractions to sets of n, a 

inversion of the raw st ™— ■ 

ot the raw structure coordinates and anv 
tombrnation of the above. "d any 

siiJi addiCi ° n ' StrUOCUral ~-dinates can ne 

t ree dL" *" ~~' ~* 1 * — 

dr.ensronal structures. Therefore.- a measure of 

-Sue set of structure! coordinates is the root ra e ' 
square deviation of the resulting, , ro °t-"ean- 

e resulting structure, structural 
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coordinates that render three dimensional structures 
that deviate from one another by a root -mean- square 
deviation of less than 1.5 A may be viewed as identical. 

EXAMPLE 2: Computer-Based Design of Modulators of 

PTK Function 

Potential modulators of PTK function were designed 
and identified by operating the program Catalyst on the 
structure of 3 - [ (3- (2 - carboxyethyl ) -4-methylpyrrol-5- 
yl) methylene] - 2 - indolinone . The chemical features 
constraining the search model include a hydrogen bond 
donor, a hydrogen bond acceptor, and two hydrophobic 
points of contact. Approximately 40 compounds were 
identified as potential modulators of PTK function using 
this method. 

The compounds identified by the method as potential 
modulators of PTK function were commercially available. 
These compounds were then tested for their ability to 
inhibit the FLK PTK in an enzyme linked immunosorbant 
assay (ELISA) . The method of performing this assay is 
taught in WO 96/40116, entitled "Indolinone Compounds 
for the Treatment of Disease ," published on December 19, 
1996, invented by Tang et al., incorporated by reference 
herein in its entirety, including all figures, drawings, 
and tables. Flk-1 specific antibodies can be prepared 
from the following protocol : 

1. Prepare a Tresyl-Act ivated Agarose/Flk-l-D column 
by incubating 10 ml of Tresyl -Activated Agarose 
with 20 mg of purified GST-Flk-l-D fusion protein 
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in zoo* sodiu. bicarbonate (pH 9 . 6) buffer 
overnight at 4°C. 



2 

3 . 

4 . 

5 . 



6 . 

10 7. 



Wash the column once with pes. 

Bloc, the excess sites on the column with 2 „ 
glycine for 2 hours at 4°C. 
Wash the column with PBS. 

Incubate the column with RaHx,-- 

winn Rabbit anti-Flk-iD 

production bleed for 2 hours at 4°c. 
Wash the column with PBS. 
■l-t. antiserum with „„ mM 

i "oo> . ° £ bU " Sr ' 3amPie C ° tUffer " 

»• the dialed Mtiserum ^ s% 

store at -ao°c in „ n a ii q uotes. 

The m-1 ELISA oan include a 2, 2 -azi„o-bis,3 

which . v abts ) solution, 

*ach can ccnprrse ioo™ citric acid (anhydrous, 2S „ mM 

;;rr 4 - o) ' 05 mg/miABTs ------ 

a at Tc SOlU " 0n " — stored in 
dark at 4 c until ready for use. 

The FLK-l specific antibodies can k 
from Santa Cru 2 Blotch i Purchased 
Cru 2 Biotechnology (Catalog No. SC-504) 

Pour of the forty compounds identified as potential 
oduiators of p TK function _ potent J < * 

unction. The se rao lecules have the fol lowing 
J0 structures: 
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The modulators inhibit the FI.K protein kinase with the 
following IC 50 values: 
5 TABLE 9 



10 



Compound 


FLK kinase 


FLK kinase 


EGFR 


1GF-1R 




ic 50 


ic so 


ic 50 


ic 5n 




(MM) 


(MM) 


(MM) 


(MM) 




compounds 


compounds 








tested at 1 OO^M 


tested at 20^M 






I 


14.8 


14 


>100 


>100 


2 


15.7 


10.6 


>100 


>100 


.> 


21.4 


16.6 


68 


30.9 


4 


22.9 


16.4 


>100 


>I00 



The invention illustratively described herein may 
be practiced in the absence of any element or elements, 
limitation or limitations which is not specifically 
disclosed herein. The terms and expressions which have 
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been employed are used as terms of description and not 
of limitation, and there is no intention that in the use 
of such terms and expressions of excluding any 
equivalents of the features shown and described or 
portions thereof, but it is recognized that various 
modifications are possible within the scope of the 
invention claimed. Thus, it should be understood that 
although the present invention has been specifically 
disclosed by preferred embodiments and optional 
features, modification and variation of the concepts 
herein disclosed may be resorted to by those skilled in 
the art, and that such modifications and variations are 
considered to be within the scope of this invention as 
defined by the appended claims. 

Those references not previously incorporated herein 
by reference, including both patent and non-patent 
references, are expressly incorporated herein by 
reference for all purposes. Other embodiments are 
within the following claims. 
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SEQUENCE LISTSNC 



(1) GENERAL INFORMATION: 



(i) APPLICANT: 



SUGEN , I NCORPORATED 
351 Galveston Drive 
Redwood City, CA 94063 



(ii) TITLE OF INVENTION: 



CRYSTAL STRUCTURES OF A 
PROTEIN TYROSINE KINASE 



(iii) NUMBER OF SEQUENCES: 



(iv) CORRESPONDENCE ADDRESS : 



(A) 
(B) 

(C) 
<D) 
(E) 
(F) 



ADDRESSEE : 
STREET : 

CITY: 
STATE : 
COUNTRY : 
ZIP: 



Lyon & Lyon 

633 West Fifth Street 

Suite 4700 

Los Angeles 

California 

U.S.A. 

90071-2066 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 

(B) COMPUTER: 

(C) OPERATING SYSTEM: 

(D) SOFTWARE : 



3.5" Diskette, 1.44 Mb 
storage 

IBM Compatible 
IBM P.C. DOS 5.0 
FastSEQ for Windows 2 . 0 



(vi) CURRENT APPLICATION DATA: 



(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 



To Be Assigned 
Herewith 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 
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fviii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME : 

(B) REGISTRATION NUMBER- 32'^' Rlchard J 
(O REFERENCE/DOCKET NUMBER : 227/088-PCT 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 

(B) TELEFAX- ' 489 "1600 

(C) TELEX • (213) 9 55-0440 

67-3510 

(2) INFORMATION FOR SEQ ID NO:l : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: , ln a . 

(B) TYPE • amin ° aClds 

amino acid 
C STRANDEDNESS: single 
TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 
(iii) HYPOTHETICAL: N0 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

r -» Ma „ y ^ s „ Glu Tyr olu ^ Mu ^ o ^ 

«- - P„ jj, asp ^ Vil ^ Qiy l ^ ^ l ^ ^ ^ ^ 

<*- «y «. VM vm ^ ». 01u Ma , u Gly l ^ ^ L ^ 

U 45 

Lys Pro Asn Arg Val Thr Lys Val Ala Val I v, m - T 

50 * ai Ala Val L VS Met Leu Lys Ser Asp 

Ala Thr Glu Lys Asp Leu Ser Asp Leu lie Ser Gin «^ m 

65 70 * AJCU - Lie ^er Glu Met Glu Met Met 

75 80 

Mr. »t zu o r g. Hi5 Lys A6 „ a . Ile ss „ Leu ^ oiy wa ^ 

90 95 
Thr Gin Asp Gly Pro Leu Tyr Val Tl» v=i 

100 ^ 31 "! Val Glu ^ Ala Ser Lys Gly 

■ Lu;> 110 

*" Leu 5 Glu m " u Q1 " »• -» -° Pro „„ Leu „. Tyr 

j-^u 125 

cy ^ *,„ P „ s „ Ht5 Pro Glu Glu Mn ^ s ^ ^ ^ ^ 



140 
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Leu.Val Ser Cys Ala Tyr Gin Val Ala Arg Gly Met Glu Tyr Leu Ala 
145 150 155 160 

Ser Lys Lys Cys lie His Arg Asp Leu Ala Ala Arg Asn Val Leu Val 
165 170 175 

Thr Glu Asp Asn Val Met Lys lie Ala Asp Phe Gly Leu Ala Arg Asp 
180 185 190 

lie His His lie Asp Tyr Tyr Lys Lys Thr Thr Asn Gly Arg Leu Pro 
195 200 205 

Val Lys Trp Met Ala Pro Glu Ala Leu Phe Asp Arg lie Tyr Thr His 
210 215 220 

Gin Ser Asp Val Trp Ser Phe Gly Val Leu Leu Trp Glu He Phe Thr 
225 230 235 240 

Leu Gly Gly Ser Pro Tyr Pro Gly Val Pro Val Glu Glu Leu Phe Lys 
245 250 255 

Leu Leu Lys Glu Gly His Arg Met Asp Lys Pro Ser Asn Cys Thr Asn 
260 265 270 

Glu Leu Tyr Met Met Met Arg Asp Cys Trp His Ala Val Pro Ser Gin 
275 280 285 

Arg Pro Thr Phe Lys Gin Leu Val Glu Asp Leu Asp Arg He Val Ala 
290 295 300 

Leu Thr Ser Asn Gin Glu 
305 310 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH : 

(B) TYPE: 

(C) STRANDEDNESS : 

(D) TOPOLOGY: 

<ii) MOLECULE TYPE: 
(iii) HYPOTHETICAL: 



315 amino acids 
amino acid 
single 
linear 

protein 

NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Ser Ala Ala Gly Thr Met Val Ala Gly Val Ser Glu Tyr Glu Leu Pro 
15 10 15 

Glu Asp Pro Arg Trp Glu Leu Pro Arg Asp Arg Leu Val Leu Gly Lys 
20 25 30 
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Pr ° Leu ? G1 " 01y A1 * - G1 " v " - - - - - - 



45 



«y - «p ^ *. P ^ ^ As „ Arg Vil Ihr Lys yai u> v ^ ^ 

6 0 

Met Leu Lys Ser AS p Ala Thr- r^i , 

?0 ^ ASP Leu Ser Asp Leu lie S 



75 ;f 

80 



«. - «. y Lya „ et IU aiy ^ his Lys Asn iie a< ^ 

- - «* »: ^ 01 „ Asp tli Pro ^ Tyr vai iie - 

- «. s Gly As „ ^ s „. Tyr Leu Gin ^ ™ ^ ^ 

- s Leu 01u Tyr s „ ^ asn pro s>r His ^ - ^ ^ 

- - se, , ys „ s val ser ^ s Ma ^ w ui ^ ^ 
M " Glu Iyi " u a s - - - <*• - -X ... 1° 

175 

Arg Asn Val Leu Val Thr n„ ft 

Glu Asp ash Val Met Lys Ile ^ Asp 

190 

OlV «*u Ma Arg Asp Jle H . s 

195 200 7 LyS Thr T ^ 

205 

- S j - VI1 Trp Met Ai , pro oiu aia L>u 

220 

S n. ^ ^ His ^ s „ ^ w ^ p ^ ^ ^ ^ ^ 
«- u. ^ _ „ y Qly s „ ^ ™ My ^ ^ - 

«■ «- u. jj. ^ „ u Leu Lys n ; His Arg Met asp ™ ^ 

270 

~ g. ^ _ 01u to ^ Met Met mt ar9 ^ cys ^ ^ 

- VU P,o s „ Gln arg Thr _ ^ Mn uu ~ ^ ^ 

300 

JSP Arg Ile Val AIa Leu 

310 315 
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(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 351 amino acids 

(B) TYPE: amino acid 
(C> STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Met Arg Gly Ser His His His His His His Gly Met Ala Ser Met Thr 
1 $ 10 15 

Gly Gly Gin Gin Met Gly Arg Asp Leu Tyr Asp Asp Asp Asp Lys Asp 
20 25 30 

Pro Ser Ser Arg Ser Ala Ala Gly Thr Met Val Ala Gly Val Ser Glu 
35 40 45 

Tyr Glu Leu Pro Glu Asp Pro Arg Trp Glu Leu Pro Arg Asp Arg Leu 
50 55 60 

Val Leu Gly Lys Pro Leu Gly Glu Gly Ala Phe Gly Gin Val Val Leu 
65 70 75 80 

Ala Glu Ala lie Gly Leu Asp Lys Asp Lys Pro Asn Arg Val Thr Lys 
85 90 95 

Val Ala Val Lys Met Leu Lys Ser Asp Ala Thr Glu Lys Asp Leu Ser 
100 105 no 

Asp Leu He Ser Glu Met Glu Met Met Lys Met He Gly Lys His Lys 
115 120 125 

Asn He He Asn Leu Leu Gly Ala Cys Thr Gin Asp Gly Pro Leu Tyr 
130 135 140 

Val He Val Glu Tyr Ala Ser Lys Gly Asn Leu Arg Glu Tyr Leu Gin 
145 150 155 160 

Ala Arg Arg Pro Pro Gly Leu Glu Tyr Ser Tyr Asn Pro Ser His Asn 
165 170 175 

Pro Glu Glu Gin Leu Ser Ser Lys Asp Leu Val Ser Cys Ala Tyr Gin 
180 185 190 

Val Ala Arg Gly Met Glu Tyr Leu Ala Ser Lys Lys Cys He His Arg 
195 200 205 
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Asp Leu Ala Ala Arg Asn Val Leu Val Thr Glu Asp Asn Val Met Lys 
210 215 oon 



220 

Ala Asp Phe Gly Leu Ala Arq Asd lie Hie 
225 230 

Lys Lys Thr Thr Asn Gly Arg Leu Pro Val Lys Tr P Met Ala Pro Glu 



lie Ala Asp Phe Gly Leu Ala Arg Asp lie His His lie Asp Tyr Tyr 

235 240 



245 250 



255 



Ala Leu Phe Asp Arg lie Tyr Thr His Gin Ser Asp Val Trp Ser Phe 
260 265 2?0 

Gly Val Leu Leu Trp Glu lie Phe Thr Leu Gly Gly Ser Pro Tyr Pro 
275 280 

Gly val Pro Val Glu Glu Leu Phe Lys Leu Leu Lys Glu Gly His Arg 
290 295 



300 



Met Asp Lys Pro Ser Asn Cys Thr Asn Glu Leu Tyr Met Met Met Arg 
5 310 315 320 

Asp Cys Trp His Ala Val Pro Ser Gin Arg Pro Thr Phe Lys Gin Leu 
325 330 335 

Val Glu Asp Leu Asp Arg He Val Ala Leu Thr Ser Asn Gin Glu 
340 345 35Q 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 933 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
ATGCTAGCAG GGGTCTCTGA GTATGAGCTT CCCGAAGACC CTCGCTGGGA GCTGCCTCGG 
GACAGACTGG TCTTAGGCAA ACCCCTGGGA GAGGGCTGCT TTGGGCAGGT GGTGTTGGCA 
GAGGCTATCG GGCTGGACAA GGACAAACCC AACCGTGTGA CCAAAGTGGC TGTGAAGATG 
TTGAAGTCGG ACGCAACAGA GAAAGACTTG TCAGACCTGA TCTCAGAAAT GGAGATGATG 
AAGATGATCG GGAAGCATAA GAATATCATC AACCTGCTGG GGGCCTGCAC GCAGGATGGT 
CCCTTGTATG TCATCGTGGA GTATGCCTCC AAGGGCAACC TGCGGGAGTA CCTGCAGGCC 
CGGAGGCCCC CAGGGCTGGA AT AC TGCT AC AACCCCAGCC ACAACCCAGA GGAGCAGCTC 



60 
120 
180 
240 
300 
360 
420 
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TCCTCCAAGG ACCTGGTGTC CTGCGCCTAC CAGGTGGCCC GAGGCATGGA GTATCTGGCC 4 80 

TCCAAGAAGT GCATACACCG AGACCTGGCA GCCAGGAATG TCCTGGTGAC AGAGGACAAT 54 0 

GTGATGAAGA TAGCAGACTT TGGCCTCGCA CGGGACATTC ACCACATCGA CTACTATAAA 6 00 

AAGACAACCA ACGGCCGACT GCCTGTGAAG TGGATGGCAC CCGAGGCATT ATTTGACCGG 66 0 

ATCTACACCC AC C AGAGTG A TGTGTGGTCT TTCGGGGTGC TCCTGTGGGA GATCTTCACT 72 0 

CTGGGCGGCT CCCCATACCC CGGTGTGCCT GTGGAGGAAC TTTTCAAGCT GCTGAAGGAG 780 

GGTCACCGCA TGGACAAGCC CAGTAACTGC ACCAACG AG C TGTACATGAT GATGCGGGAC 84 0 

TGCTGGCATG CAGTGCCCTC ACAGAGACCC ACCTTCAAGC AGCTGGTGGA AGAC CTGGAC 900 

CGCATCGTGG CCTTGACCTC CAACCAGGAG TAG 933 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1056 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

ATGCGGGGTT CTCATCATCA TCATCATCAT GGTATGGCTA GCATGACTGG TGGACAGCAA 60 

ATGGGTCGGG ATCTGTACGA CGATGACGAT AAGGATCCGA GCTCGAGATC TGCAGCTGGT 120 

ACCATGGTAG CAGGGGTCTC TGAGTATGAG CTTCCCGAAG ACCCTCGCTG GGAGCTGCCT 180 

CGGGACAGAC TGGTCTTAGG CAAACCCCTG GGAGAGGGCG CCTTTGGGCA GGTGGTGTTG 24 0 

GCAGAGGCTA TCGGGCTGGA CAAGGACAAA CCCAACCGTG TGACCAAAGT GGCTGTGAAG 3 00 

ATGTTGAAGT CGGACGCAAC AGAGAAAGAC TTGTCAGACC TGATCTCAGA AATGGAGATG 360 

ATGAAGATGA TCGGGAAGCA TAAGAATATC ATCAACCTGC TGGGGGCCTG CACGCAGGAT 420 

GGTCCCTTGT ATGTCATCGT GGAGTATGCC TCCAAGGGCA ACCTGCGGGA GTACCTGCAG 480 

GCCCGGAGGC CCCCAGGGCT GGAATACTCC TACAACCCCA GCCACAACCC AGAGGAGCAG 54 0 

CTCTCCTCCA AGGACCTGGT GTCCTGCGCC TACCAGGTGG CCCGAGGCAT GGAGTATCTG 600 

GCCTCCAAGA AGTGCATACA CCGAGACCTG GCAGCCAGGA ATGTCCTGGT GACAGAGGAC 6 60 
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AATGTGATGA AGATAGCAGA 

"-OAGAA CCAACGGGGG ACrGCCTGTG AAG™ cacccgaggc 
CGGATCTACA CCCACCAGAG TGATGTGTGG TCTTTCGGGG T G„G GGAGATC^ 
ACTCTGGGCG GCTCCCCATA GGGCGG^ cg^gg aac™ gg TCCT g mo 
— CACC GCATGGACAA GCCGAGTAAC TGGAGCAACG AGCTGTACAT GATGATGCGG 
GACTGCTGGC ATGCAGTGCC CTCACAGAGA CCCACC^CA AGCAGCTGGT GGAAGACCTG 
GACCGCATCG TGGCCTTGAG CTGGAACGAG GAGTAG 



720 
780 
840 
900 
960 
1020 
1056 
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TABLE i 



Atom 
No. 


Atom 
Tvoe 


A . A 


A . A 
No . 


X 




y 
i 




7 

La 


occ 


ts 




ATOM 


1 


XT 
IN 


/~f T TT 


T >l C A 

14b 4 


-13 


. 639 


16 


. 975 


8 


. 571 


1 


.00 


54 


. 29 


ATOM 


3 


l_A 


LsLiU 


14 64 


-12 


.479 


17 


. 105 


7 


. 695 


1 


. 00 


52 


. 62 


ATOM 


4 


l_±5 




T A C A 
1464 


-11 


.400 


17 


. 974 


8 


. 349 


1 


.00 


54 


. 64 


ATOM 


5 


(_ 


f-*T TT 


1464 


-11 


. 914 


15 


. 738 


7 


.319 


1 


.00 


49 


. 74 


ATOM 


6 


u 


GLU 


1464 


-11 


. 845 


15 


.407 


6 


. 136 


1 


.00 


52 


. 04 


ATOM 


7 


TvT 

N 


LEU 


1465 


-11 


. 562 


14 


. 925 


8 


.310 


1 


.00 


44 


. 95 


ATOM 


9 


<~A 


T TTTT 


T A CZ C 

14 b b 


-11 


.018 


13 


► 599 


8 


. 037 


1 


.00 


41 


. 04 


ATOM 


10 


nn 

L.B 


T PTT 


14b b 


-10 


.236 


13 


. 066 
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# 



ATOM 4103 CG ASP 735 
ATOM 4104 OD1 ASP 735 
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C 
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O 
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ATOM 
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N 


CYS 
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ATOM 4122 NE1 TRP 737 
ATOM 4123 CZ2 TRP 737 



ATOM 4124 CZ3 TRP 737 

ATOM 4125 CH2 TRP 73 7 

ATOM 4126 C TRP 73 7 

ATOM 4127 O TRP 737 

ATOM 4128 N HIS 738 

ATOM 4129 CA HIS 738 

ATOM 4130 CB HIS 738 

ATOM 4131 CG HIS 738 

ATOM 4132 CD2 HIS 738 

ATOM 4133 ND1 HIS 738 

ATOM 4134 CE1 HIS 73 8 

ATOM 4135 NE2 HIS 738 
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CD 


PRO 


741 


ATOM 


4152 


CA 


PRO 


741 


ATOM 


4153 


CB 


PRO 


741 


ATOM 


4154 


CG 


PRO 


741 



45.690 -1. 175 32 .604 1 . 00 47.78 

46.116 -2.355 32.524 1 . 00 47 92 

44.733 -0.803 33.339 1 . 00 46.'75 

46.339 2.161 30.881 1.00 37 68 

I?-"! 3 178 31 "9 1.00 36.83 

46 996 1.967 29.734 1.00 35.06 
47.858 2.979 29.140 1 . 00 32.91 
48.499 2.469 27.848 1 . 00 27 65 
49.631 1.067 27.989 1 . 00 31.' 0 8 

47 054 4.232 28.828 1.00 35.28 



737 


45.742 


737 


44 . 885 


737 


43 . 890 


737 


44 .535 


737 


43 . 976 


737 


44 . 932 


737 


42 . 763 


737 


45 . 766 


737 


46. Oil 


737 


44 . 708 


737 


42.549 


73 7 


43 . 518 


737 


44 . 159 


737 


43 .163 


738 


44 .685 


738 


44 . 059 


738 


44 .698 


738 


43 . 970 


738 


43 . 685 



1-956 23.820 1.00 38.42 



5.643 3C.743 1 . 00 43.61 



------ -l.UL/ l^.ij 

43.401 4.961 35.252 1.00 52 48 

42-798 5.541 36.275 1.00 55 70 

42.955 6.848 36.159 1 . 00 51.42 

44.202 7.714 31.969 1.00 44 15 

45.294 8.223 31.787 1. 00 43.*14 

43.115 8.428 32.272 1.00 45 42 

43.141 9.895 32.318 1.00 47 29 

41.792 10.426 32.752 1.00 49*75 

44.240 10.454 33.223 1.00 48 73 

44.921 11.415 32.868 1.00 49.32 

44.331 9.893 34.425 1.00 50 51 

45.332 10.262 35.424 1.00 5132 
44.861 9. 88 o 36.842 1.00 52.' 2 9 
45.905 10.254 37.869 1.00 53 73 
43.551 10.575 37.152 1.00 53 54 
46.656 9.535 35.121 1.00 51.' 06 
46.780 8.320 35.348 1.00 50 81 
47.670 10.280 34.657 1 . 00 50 12 
47.595 11.738 34.454 1.00 50 19 
49.003 9.775 34.294 1.00 Sl.'lO 
49.790 11.060 34.024 1.00 50 35 
48.731 11.978 33.492 1.00 5 0 "l3 
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C LYS 
O LYS 
N GLN 

4218 CA GLN 

4219 CB GLN 

4220 CG GLN 

4221 CD GLN 

4222 OE1 GLN 
4 223 NE2 GLN 

4224 C GLN 

4225 O GLN 

4226 N LEU 
4 22 7 CA LEU 
4228 CB LEU 
4 22 9 CG LEU 

4230 CD1 LEU 

4231 CD2 LEU 
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O 
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750 
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750 
751 
751 
751 
751 
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751 
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752 
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752 
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753 
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753 
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58 . 008 
58 .328 
58 . 920 
59. 529 
60.200 
60.917 
61.353 
62.135 
57.813 
58.025 
56.622 
55 .454 
54.254 
54 . 378 
53.203 
53.392 
51.988 
55. 049 
54.964 
54 . 810 
54 .409 

54 .358 
53 .369 
53.745 
51.941 

55 .369 
54 . 934 
56.673 
57.656 
59.129 
60. 092 
59.415 
57 .428 
57.492 
57 . 109 
56 .854 
56. 779 
58.093 
58.215 
58.554 
58.021 
55.594 
55.646 
54.490 
53.232 
52.119 
51.579 
51.440 
51.281 
53.371 
53 . 001 
53.903 
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-2. 047 
-1.515 
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-2.141 
-2.503 
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-2 . 
-2 . 
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727 



-0. 
0 . 



-0. 951 
-3.588 
-4 . 810 
-2.817 
-3.355 
-2.241 
-1.091 
0 . 037 
-1 . 578 
-4.437 
-5.449 
-4 .212 
-5.217 
-4.724 
-5.836 
-3 .535 
-6 .493 
-7.599 
-6.338 
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-6.448 
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-7.228 
-8.256 
-9.464 
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-8 .128 
-7 
-6 
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707 
-7.589 
-5.513 
-8.771 
-9. 930 
-8 . 009 



31 . 156 
32 . 972 
33 . 727 
35. 026 
35.838 
37 . 064 
37. 973 
39.127 
34 . 008 

33 . 848 

34 . 383 
34 .669 
35.134 
36 . 500 
36.797 
37.123 
36.665 
33 .397 
33 .369 
32.340 
31 . 033 
29. 984 
30 . 177 
29.217 
2 9. 934 
30. 557 
30 . 014 
30 . 721 
30.312 
30.485 
30 . 120 
29.598 
31.131 
30.594 
32 .414 
33 . 266 
34.743 
35.212 
36 . 707 
37.136 
37.452 
32 .809 
32 . 551 
32 .640 
32.193 
32.118 
33 .467 
34.330 
33.659 
30.837 
30 . 649 
29.889 



1.00 31.47 
1.00 38.86 
1.00 39. 79 
1 * 00 43.68 
1 • 00 48.11 
1.00 51.33 
1.00 51.17 
1.00 56.55 
1 • 00 41 . 14 
1 . 00 38 .24 
1. 00 41.20 
1.00 40.49 
1 ■ 00 45 . 70 
1.00 50.61 
1.00 55.26 
1-00 58.00 
1.00 59.25 
1 . 00 37.42 
1.00 36.00 
1.00 36.76 
1 - 00 35.39 
1-00 30.97 
1.00 27.36 
1.00 29.15 
1-00 29.22 
1-00 35.16 



1 
1 
1 
1 
1 
1 
1 
1 
1 . 
1. 
1 . 
1 . 
1. 
1. 
1. 
1 
1 
1 
1 
1 
1. 



1 . 00 34 . 45 
1.00 38.76 
1.00 38.69 
1.00 33.81 
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What is claimed is: 

1. A crystalline form of a polypeptide 
corresponding to the catalytic domain of a protein 
tyrosine kinase . 

2. The crystalline form of claim 1, wherein said 
protein tyrosine kinase is a receptor protein tyrosine 
kinase . 

3. The crystalline form of claim 2, wherein said 
receptor protein tyrosine kinase is selected from the 
group consisting of PDGF-R, FLK, CCK4 , MET, TRKA, AXL, 
TIE, EPH, RYK, DDR, ROS , RET, LTK, ROR1 , and MUSK. 

4. The crystalline form of claim 1, wherein said 
protein tyrosine kinase is a non-receptor protein 
tyrosine kinase . 

5. The crystalline form of claim 4, wherein said 
non-receptor protein tyrosine kinase is selected from a 
group consisting of SRC, BRK, BTK, CSK, ABL, ZAP70, FES, 
FAK, JAK, and ACK. 

6. The crystalline form of claim 1, comprising 
one or more heavy metal atoms. 



7. The crystalline form of claim 1, wherein said 
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protein tyrosine kinase is FGFR. 

8- The crystalline form of claim 7, wherein said 
FGFR is FGFR1. 



9 The crystalline form of claim 8, defined by 



atomic structural coordinates set forth in Tab! 



e l 



, 10 ■ CryStalli - *o» of ciain, 7. co.pri.ing at 

least one compound. 



11- The crystalline form of claim 10, wherein said 
compound is a nucleotide analog. 



12. The crystalline form of claim li, wherein said 
nucleotide analog is AMP-PCP. 



13. The crystalline form of claim 12, defined by 
atomic structural coordinates set forth in Table 2. 

14- The crystalline form of claim 10, wherein said 
compound is an indolinone compound. 

15. The crystalline form of claim 14, wherein said 
indolinone compound has a structure set forth in formula 
I or li: 
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or a pharmaceutical]^ acceptable salt, isomer, 
metabolite, ester, amide, or prodrug thereof, wherein 

(a) A lt A 2 , A 3 , and A 4 are independently carbon or 
nitrogen; 

(b) Rj is hydrogen or alkyl; 

(c) R 2 is oxygen in the case of an oxindolinone or 
sulfur in the case of a thiolindolinone ; 

(d) R 3 is hydrogen; 

(e) R 4/ R S/ R 6 , and R 7 are optionally present and are 
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either (i) independently selected from the group 
consisting of hydrogen, alkyl, alkoxy, aryl, aryloxy, 
alkaryl, alkaryloxy, halogen, trihalomethyl , s(o)R 
SO,NRR-, S0 3 R, SR, N0 2 , NRR 1 , OH, CN, C(0,R, 0C(0) R '' 
NHC(0)R, (CH 2 > n C0 2 R, and CONRR ■ or (ii) any two 
R*. Rs, Ro, and R 7 taken together form a fused ring wich 
the aryl portion of the oxindole-based portion of the 
indolinone; 

< f > R 2 ', R 3 ', R<', R 5 ., and R 6 ■ are each 
independently selected from the group consisting of 
hydrogen, alkyl, alkoxy, aryl, aryloxy, alkaryl, 
alkaryloxy, halogen, trihalomethyl, S(0)R, so ? NRR ■ , S 0 3 R 
SR. N0 2 , NRR-, OH, CN, C(0)R, OC(0)R, NHC(0)R, (CH^OD*'. 
and CONRR 

(g) n is 0, l, 2, or 3; 

(h) R is hydrogen, alkyl or aryl; 

(i) R' is hydrogen, alkyl or aryl; and 

(j) A is a five membered heteroaryl ring selected 
from the group consisting of thiophene, pyrrole 
pyrazole, imidazole, 1 , 2 , 3 -triazole, 1 , 2 , 4 -triazole 
oxazole, isoxazole, thiazole, isothiazole, furan, 
oxadiazole, 1 , 2 , 4 -oxadiazole, 1 , 2 , 5 -oxadiazole , 1 3 4 _ 
oxadiazole, l , 2 , 3 , 4 -oxatriazole , 1, 2, 3, 5-oxatriazole 

1.2.3- thiadiazole, 1 , 2 , 4 - thiadiazole, 1, 2, 5-thiadiazole, 

1.3.4- thiadiazole, 1 , 2 , 3 , 4 - thiatriazole , 1,2,3,5- 
thiatriazole, and tetrazole, optionally substituted at 
one or more positions with alkyl, alkoxy, aryl, aryloxy< 
alkaryl, alkaryloxy, halogen, trihalomethyl, s(0)R 
S0 2 NRR-, S0 3 R, SR, N 0 2 , NRR' , OH, CN, C(0)R, 0C(0)R,' 
NHC(Q)R, (CH 2 ) n C0 2 R or CONRR' . 
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16. The crystalline form of claim 15, wherein said 
indolinone compound is 3- [ {3- (2 -carboxyethyl ) -4- 
methylpyrrol - 5 -yl) methylene] -2 - indolinone . 

17. The crystalline form of claim 15, wherein said 
indolinone compound is 3 - [4 - (4-f ormylpiperazine-1- 

yl) benzylidenyl] -2 -indolinone . 

18. The crystalline form of claim 16, defined by 
the atomic structural coordinates of Table 3 . 

19. The crystalline form of claim 17 , defined by 
the atomic structural coordinates of Table 4. 

20. The crystalline form of claim 1, having 
monoclinic unit cells. 

21. The crystalline form of claim 20, wherein said 
monoclinic unit cells have dimensions of about a=208.3 
A, b=57.8 A, c=65.5 A and 3=107.2°. 

22. The crystalline form of claim 20, wherein said 
monoclinic unit cells have dimensions of about a=211.6 
A, b=51.3 A, c=66.1 A and 3=107.7°. 

23. The crystalline form of claim 10, comprising 
one or more heavy metal atoms. 



24. A polypeptide corresponding to the catalytic 
domain of a protein tyrosine kinase, containing at least 
about 20 amino acid residues upstream of the first 
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glycine in the conserved glycine-rich region of the 
catalytic domain, and at least about 17 amino acid 
residues downstream of the conserved arginine located at 
the c- terminal boundary of the catalytic domain. 

25. The polypeptide of claim 24, wherein said 
protein tyrosine kinase is a receptor protein tyrosine 
kinase. 



26. The polypeptide of claim 24, wherein said 
protein tyrosine kinase is a non-receptor protein 
tyrosine kinase. 

27. The polypeptide of claim 25, wherein said 
receptor tyrosine kinase is selected from the group 
consisting of FGF-R, PDGF-R, KDR, CCK4 , MET, TRKA AXL 
TIE, EPH, RYK, DDR, ROS, RET, LTK, ROR1, and MUSK . 

28. The polypeptide of claim 26, wherein said non- 
receptor kinase is selected from the group consisting of 
SRC BRK, BTK, CSK, ABL, ZAP70 , FES, FAK, JAK, and ACK. 

29. The polypeptide of claim 24 having the amino 
acid sequence shown in SEQ ID NO: 4. 

30. A method of using the polypeptide of claim 24 
to form a crystal, comprising the steps of: 

(a) mixing a volume of polypeptide solution 
with a reservoir solution,- and 

(b) incubating the mixture obtained in step 
(a) over the reservoir solution in a closed container, 
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under conditions suitable for crystallization. 

31. A method of obtaining an FGF receptor tyrosine 
kinase domain polypeptide in crystalline form, 
comprising the steps of: 

(a) mixing a volume of polypeptide solution 
with an equal volume of reservoir solution, wherein said 
polypeptide solution comprises 1 mg/mL to 60 mg/mL FGF- 
type tyrosine kinase domain protein, 10 mM to 200 mM 
buffering agent, 0 mM to 20 mM dithiothreitol and has a 
pH of about 5.5 to about 7.5, and wherein said reservoir 
solution comprises 10% to 30% (w/v) polyethylene glycol, 
0.1 M to 0.5 M ammonium sulfate, 0% to 20% (w/v) 
ethylene glycol or glycerol, 10 mM to 200 mM buffering 
agent and has a pH of about 5.5 to about 7.5; and 

(b) incubating the mixture obtained in step 
(a) over said reservoir solution in a closed container 
at a temperature between 0° and 25° °C until crystals 
form. 

32. The method of claim 31, wherein said 
polypeptide solution comprises about 10 mg/mL FGF 
receptor tyrosine kinase domain, about 10 mM sodium 
chloride, about 2 mM dithiothreitol, about 10 mM Tris- 
HC1 and has a pH of about 8; the reservoir buffer 
comprises about 16% (w/v) polyethylene glycol (MW 
10000), about 0.3 M ammonium sulfate, about 5% ethylene 
glycol or glycerol, about 100 mM bis-Tris and has a pH 
of about 6.5; and the temperature is about 4°C. 



33. The method of claim 31, wherein said 
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polypeptide solution comprises a compound. 

34. a CDNA encoding an FGF receptor tyrosine 
kinase domain protein, wherein a coding strand of the 
cDNA has the nucleotide sequence of SEQ ID N0:5. 

35. A method of determining three dimensional 
structures of protein tyrosine kinases with unknown 
structure comprising the step of applying structural 
atomic coordinates set forth in Table 1, Table 2, Table 
3 , or Table 4 . 

36. The method of claim 35, comprising the 
following steps: 

(a) aligning a first computer representation 
of an amino acid sequence of a protein tyrosine kinase 
of unknown structure with a second computer 
representation of a protein tyrosine kinase of known 
structure by matching homologous regions of amino acid 
sequences of said first computer representation and said 
second computer representation; 

(b) transferring computer representations of 
amino acid structures in said protein tyrosine kinase of 
known structure to computer representations of 
corresponding amino acid structures in said protein 
tyrosine kinase with unknown structure; and 

(O determining a low energy conformation of 
the protein tyrosine kinase structure resulting from 
step (b) . 

37. The method of claim 35, comprising the 
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following steps: 

(a) aligning the positions of atoms in the 
unit cell by matching electron diffraction data from two 
crystals; and 

(b) determining a low energy conformation of 
the resulting protein tyrosine kinase structure. 

38. The method of claim 35, comprising the 
following steps: 

(a) determining the secondary structure of a 
protein tyrosine kinase structure using NMR data; and 

(b) simplifying the assignment of through- 
space interactions of amino acids. 

39. The method of any one of claims 35, 36, 37, or 
38, wherein said protein tyrosine kinase with or without 
known structure is a receptor protein tyrosine kinase. 

'40. The method of claim 39, wherein said receptor 
protein tyrosine kinase with or without known structure 
is selected from the group consisting of FGF-R, PDGF-R, 
FLK, CCK4, MET, TRKA, AXL, TIE, EPH, RYK, DDR, ROS , RET, 
LTK, R0R1, and MUSK. 

41. The method of anyone of claims 35, 36, 37, or 
38, wherein said protein tyrosine kinase with or without 
known structure is a non- receptor protein tyrosine 
kinase . 

42. The method of claim 41, wherein said protein 
tyrosine kinase with or without known structure is 
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selected from the group consisting of SRC , BRK, BTK, 
CSK, ABL, ZAP70, PES, FAK, JAK, and ACK. 

43. A method of identifying a potential modulator 
of protein tyrosine kinase function by docking a 
computer representation of a structure of a compound 
wxth a computer representation of a structure of a 
cavity formed by the active-site of a protein tyrosine 
klnaSe ' WhSrein Said structure of said protein tyrosine 
kinase is defined by atomic structural coordinates set 
forth in Table 1, Table 2, Table 3, or Table 4. 

44. The method of claim 43, comprising the 
following steps.- 

(a) remov ing a computer representation of a 
compound complexed with a protein tyrosine kinase and 
docking a computer representation of a compound from a 
computer data base with a computer representation of the 
aCtlVe - site of Protein tyrosine kinase,- 

(b) deteri "ining a conformation of the complex 
resulting from step (a) with a favorable geometric fit 
and favorable complementary interactions; and 

(c) identifying compounds that best fit said 
active-site as potential modulators of protein tyrosine 

25 kinase function. 



30 



45. The method of claim 43, comprising the 
following steps: 

(a) modifying a computer representation of 
compound complexed with a protein tyrosine kinase by the 
deletion of a chemical group or groups or by the 
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addition of a chemical group or groups; 

(b) determining a conformation of the complex 
resulting from step (a) with a favorable geometric fit 
and favorable complementary interactions; and 

(c) identifying compounds that best fit the 
protein tyrosine kinase active-site as potential 
modulators of protein tyrosine kinase function. 

46. The method of claim 43, wherein said method 
comprises the following steps: 

(a) removing a computer representation of a 
compound complexed with a protein tyrosine kinase; and 

(b) searching a data base for data base 
compounds similar to said compounds using a compound 
searching computer program or replacing portions of said 
compound with similar chemical structures from a data 
base using a compound construction computer program. 

47. The method of any one of claims 43, 44, 45, or 
46, wherein said protein tyrosine kinase is a receptor 
protein tyrosine kinase . 

48. The method of claim 47, wherein said receptor 
protein tyrosine kinase is selected from the group 
consisting of FGF-R, PDGF-R, FLIC, CCK4 , MET, TRKA, AXL, 
TIE, EPH, RYK, DDR, ROS, RET, LTK, R0R1 , and MUSK. 

49. The method of anyone of claims 43, 44, 45, or 
46, wherein said protein tyrosine kinase is a non- 
receptor protein tyrosine kinase. 
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50. The method of claim 49, wherein said protein 
tyrosine kinase is selected from the group consisting of 
SRC BRK, BTK, CSK, ABL, 2AP70, FES , FAK, JAK, and ACK. 

51. a potential modulator of protein tyrosine 
kinase function identified by the method of any one of 
claims 43, 44, 45, or 46. 

52. The potential modulator of claim 51, wherein 
m ° dulator i- Elected from a computer data base. 

53. The potential modulator of claim 51, wherein 

said modulator is constructed f rom „ u 

yu&truccea trom chemical groups 

selected from a computer data base. 

54. The potential modulator of protein tyrosine 
kinase function of claim 51, wherein said modulat 
an indolinone compound of formula I or II: 



or is 
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A 




or a pharmaceutically acceptable salt, isomer, 
metabolite, ester, amide, or prodrug thereof, wherein 

(a) A lt A 2 , A 3/ and A 4 are independently carbon or 
nitrogen; 

(b) R 2 is hydrogen or alkyl; 

(c) R 2 is oxygen in the case of an oxindolinone or 
sulfur in the case of a thiolindolinone ; 

(d) R 3 is hydrogen; 

(e) R 4 , R 5 , R 6 , and R 7 are optionally present and are 
either (i) independently selected from the group 
consisting of hydrogen, alkyl, alkoxy, aryl, aryloxy, 
alkaryl, alkaryloxy, halogen, trihalomethyl , S(0)R, 
S0 2 NRR', S0 3 R, SR, N0 2 , NRR' , OH, CN, C(0)R, OC(0)R, 
NHC(0)R, (CH 2 ) n C0 2 R, and CONRR 1 or (ii) any two adjacent 
R 4 , R 5 , R 6 , and R 7 taken together form a fused ring with 
the aryl portion of the oxindole -based portion of the 
indolinone ; 

( f ) R 2 1 , R 3 1 , R 4 ■ , R s ' , and R 6 » are each 
independently selected from the group consisting of 
hydrogen, alkyl, alkoxy, aryl, aryloxy, alkaryl, 
alkaryloxy, halogen, trihalomethyl, S(0)R, S0 2 NRR • , S0 3 R, 
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SR. NO,. m .. OH, CN. C (0)R. OC(0, R . „c,0)R, ( c Hj ) B CO,R 
and CONRR 

(g) n is 0, l, 2, or 3; 

(h) R is hydrogen, alkyl or aryl ; 

(i) R- is hydrogen, alkyl or aryl; and 
lj) A is a five membered heteroaryl ring selected 

from the group consisting of thiophene, P y rrole 
Pyrazole, imidazole, 1, 2, 3-triazole, 1, 2 , 4-triazole 
oxazole, isoxazole, thiazole, isothiazole, furan, ! 2 3 - 
oxadiazole, 1 , 2 , 4 -oxadiazole , 1 , 2 , 5-oxadiazole , l 3 4- 
oxadiazole, 1 , 2 , 3 , 4-oxatriazole , 1 , 2 , 3 , 5-oxatriazole 
1,2,3-thiadiazole, 1 , 2 , 4 - thiadiazole , 1 , 2 , 5- thiadiazole , 

thiatriazole, and tetrazole,. optionally substituted at 
one or more positions with alkyl, alkoxy, aryl, aryloxy 
alkaryl, alkaryloxy, halogen, trihalomethyl , S(0)R 
SO.NRR-, S0 3 R, SR, N0 2 , NRR ' , OH, CN, C(0)R, OC (O) R ' 
NHC(0)R, ( CH 2 ) n CO z R or CONRR ' 

55. A method of identifying a potential modulator 
of protein tyrosine kinase function as a modulator of 
protein tyrosine kinase function, comprising the 
following steps: 

U) admini stering said potential modulator to 

25 cells; 

(b) comparing the level of protein tyrosine 
kxnase phosphorylation between cells not administered 
the potential modulator and cells administered said 
potential modulator,- and 
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(O identifying said potential modulator as a 
modulator of protein tyrosine kinase function based on 
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the difference in the level of protein tyrosine kinase 
phosphorylation . 

56 . A method of identifying a potential modulator 
of protein tyrosine kinase function as a modulator of 
protein tyrosine kinase function, wherein said method 
comprises the following steps: 

(a) administering a preparation of said 
potential modulator to cells; 

(b) comparing the rate of cell growth between 
cells not administered the modulator and cells 
administered the modulator; and 

(c) identifying said potential modulator as a 
modulator of protein tyrosine kinase function based on 
the difference in the rate of cell growth. 

57. A method of treating a disease associated with 
a protein tyrosine kinase with inappropriate activity in 
a cellular organism, wherein said method comprises the 
steps of : 

(a) administering a modulator of protein 
tyrosine kinase function to the organism, wherein said 
modulator is in an acceptable pharmaceutical 
preparation; and 

(b) activating or inhibiting the protein 
tyrosine kinase function to treat the disease. 



58. The method of any one of claims 55, 56, or 57, 
wherein said protein tyrosine kinase is a receptor 
protein tyrosine kinase . 
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59 



The method of claim 58, wherein said receptor 
protein tyrosine kinase is selected from the group 
containing FGF-R, PDGF-R, FLK , CCK4 , MET, TRKA, AXL 
TIE, EPH, RYK, DDR, ROS , RET, LTK, ROR1 , and MUSK . 

60. The method of any one of claims 55, 56, or 57, 
wherein said protein tyrosine kinase is a non-receptor ' 
protein tyrosine kinase. 

61. The method of claim 60, wherein said non- 
receptor protein tyrosine kinase is selected from a 
group consisting of SRC, BRK, BTK, CSK, ABL, ZAP70 , FES, 
FAK, JAK, and ACK. 
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DESCRIPTIO N 

CRYSTAL STRUCTURES OF A PROTEIN TYROSINE KINASE 

5 RELATED APPLICATIONS 

This application is related to U.S. Application 
Serial No. 08/701,191, by Mohammad i , et al . , entitled 
"Crystals of the Tyrosine Kinase Domain of Non- Insulin 
Receptor Tyrosine Kinases," filed August 21, 1996 (Lyon 

10 8c Lyon Docket No. 227/088) and U.S. Application Serial 
No. 60/034,168, by McMahon, et al . , entitled "Crystal 
Structures of a Protein Tyrosine Kinase Complexed with 
Compounds of the Oxindolinone/Thiolindolinone Family, " 
filed December 19, 1996 (Lyon & Lyon Docket No. 

15 221/282) , which are hereby incorporated herein by 

reference in their entirety including any drawings, 
tables, and figures. 

INTRODUCTION 

20 The present invention relates to the three 

dimensional structures of protein kinases. 

BACKGROUND OF THE INVENTION 
The following description of the background of the 
25 invention is provided simply as an aid in understanding 
the invention and is not admitted to describe or 
constitute prior art to the invention. 

Protein tyrosine kinases (PTKs) comprise a large 
and diverse class of enzymes (for a review, see 
30 Schlessinger and Ullrich, 1992, Neuron 9: 383-391) . The 
PTK family contains multiple subfamilies, one of which 
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is the fibroblast growth factor receptor (FGF-R) 
subfamily (for a review, see Givol and Yayon, 1992, 
FASEB J. 6 (15): 3362-3369). 

All PTKs enzymatically transfer a high energy 
5 phosphate from adenosine triphosphate to a tyrosine 
residue in a target protein. These phosphorylation 
events regulate cellular phenomena in signal 
transduction processes. Cellular signal transduction 
processes contain multiple steps that convert an 

10 extracellular signal into an intracellular signal. The 
intracellular signal is then converted into a cellular 
response. PTKs are components in many signal 
transduction processes. A PTK regulates the flow of a 
signal in a particular step in the process by 

15 phosphorylating a downstream molecule. The addition of 
a phosphate can either modulate the activity of the 
downstream molecule by turning it "on" or u of f " . Thus, 
aberrations in a particular PTK's activity can either 
cause overflow or underflow of the signal. Overflow of 

2 0 a signal can lead to such abnormalities as uncontrolled 
cell proliferation, which is representative of such 
disorders as cancer and angiogenesis . 

Scientists in the biomedical community are 
searching for PTK inhibitors that down-regulate overflow 

25 signal transduction pathways. In particular, small 

molecule PTK inhibitors are sought that can traverse the 
cell membrane and not become hydrolyzed in acidic 
environments. These small molecule PTK inhibitors can 
be highly bioavailable and can be administered orally to 

30 patients. 

Some small molecule PTK inhibitors have already 
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been discovered. For example, bis (monocyclic) , bicyclic 
or heterocyclic aryl compounds (PCT WO 92/20642) , 
vinylene-azaindole derivatives (PCT WO 94/14808) , 1- 
cyclopropyl~4-pyridyl-quinolones (U.S. Patent No. 
5,330,992), styryl compounds (U.S. Patent No. 
5,217,999), styryl -substituted pyridyl compounds (U.S. 
Patent No. 5,302,606), certain quinazoline derivatives 
(EP Application No. 0 566 266 Al), seleoindoles and 
selenides (PCT WO 94/03427) , tricyclic polyhydroxylic 
compounds (PCT WO 92/21660) , and benzylphosphonic acid 
compounds (PCT WO 91/15495) are described as PTK 
inhibitors . 

Although many PTK inhibitors are known, many of 
these are not specific for PTK subfamilies and will 
therefore cause multiple side-effects as therapeutics. 
Compounds of the indolinone family, however, are 
specific for the FGFR subfamily and are non- 
hydrolyzable . WO 96/4 0116, "Indolinone Compounds for 
the Treatment of Disease," published December 19, 1996, 
inventors Tang et al . Although the use of X-ray 
crystallography has provided three dimensional 
structures of other PTKs, they are not complexed with 
PTK subfamily specific, hydrolysis resistant, small 
molecules . 

Despite recent advances, the need remains in the 
art for crystallographic analysis of protein kinases, so 
that improved therapeutic molecules can be designed and 
synthesized . 



SUMMARY OF THE INVENTION 
The present invention relates to the three 
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dimensional structures of protein tyrosine kinases. The 
use of X-ray crystallography can define the three 
dimensional structure of protein tyrosine kinase at 
atomic resolution . 
5 The three dimensional structures described herein 

elucidate specific interactions between protein tyrosine 
kinases and compounds bound to them. The coordinates 
that define the three dimensional structures of protein 
tyrosine kinases are useful for determining three 
10 dimensional structures of PTKs with unknown structure. 
In addition, the coordinates are also useful for 
designing and identifying modulators of protein tyrosine 
kinase function. These modulators are potentially 
useful as therapeutics for diseases, including (but 
15 limited to) cell proliferative diseases, such as cancer, 
angiogenesis , atherosclerosis, and arthritis. 

Thus in a first aspect, the invention features a 
crystalline form of a polypeptide corresponding to the 
catalytic domain of a protein tyrosine kinase. 
20 The term "crystalline form," in the context of the 

invention, is a crystal formed from an aqueous solution 
comprising a purified polypeptide corresponding to the 
catalytic domain of a PTK. A crystalline form of a 
protein tyrosine kinase is characterized as being 
25 capable of diffracting x-rays in a pattern defined by 
one of the crystal forms depicted in Blundel et al . , 
1976, Protein Crystallography , Academic Press. A 
crystalline form of a protein kinase is not 
characterized as being capable of diffracting x-rays^in 
30 a pattern analogous to a crystalline form consisting of 
primarily salt or primarily a compound, for example. 
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